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(57) Abstract 



Methods for the production of purified, catalytically active, recombinant memapsin (2) have been developed. The substrate and 
subsite specificity of the catalytically active enzyme have been determined. The substrate and subsrte specificity information 
vy^as used to design substrate analogs of the natural memapsin (2) substrate that can inhibit the function of memapsin (2). The 
substrate analogs are based on peptide sequences, shown to be related to the natural peptide substrates for memapsin (2). The 
substrate analogs contain at least one analog of an amide bond which is not capable of being cleaved by memapsin (2). Processes 
for the synthesis of two substrate analogues including isosteres at the sites of the critical amino acid residues were developed 
and the substrate analogues, OMR99-1 and OM99-2, were synthesized. OM99-2 is based on an octapeptide Glu-Vat-Asn-Leu-Ala- 
Ala-Glu-Phe (SEQ ID NO;28) with the Leu-Ala peptide bond substituted by a transition-state tsostere hydroxyethytene group (Fig. 1) 
. The inhibition constant of OM99-2 is 1.6 x 109^, M against recombinant pro-memapsin 2. Crystallography of memapsin 2 bound to 
this inhibitor was used to determine the three dimensional structure of the protein, as well as the importance of the various 
residues in binding. This information can be used by those skilled in the art to design new inhibitors, using commercially 
available software programs and techniques familiar to those in organic chemistry and enzymology, to design new inhibitors to 
memapsin (2), useful in diagnostics and for the treatment and/or prevention of Alzheimer's disease. 

(57) Abrege 

Linvention concerne des precedes de production d'une memapsine (2) recombinante purifiee catalytiquement active. La 
specificite du substrat et du site secondaire de I'enzyme catalytiquement active a ete determinee. L'information relative a la 
speclficrte du substrat et.du site secondaire a ete utilisee pour mettre au point des analogues du substrat de ta memapsine (2) 
naturelle pouvant inhiber la fonction de la memapsine (2). Les analogues du substrat sont bases sur des sequences peptidiques 
dont il est d6montre qu'efles sont apparentees aux substrats peptidiques naturels destines a la memapsine (2). Les analogues du 
substrat contiennent au moins un analogue d'une liaison amine qui ne peut etre clivee par la memapsine (2). Des precedes de 
synthase de deux analogues du substrat incluant des isosteres aux sites des r6sidus critiques d'acide amin6 ont 6te mis au point 
et les analogues du substrat, OMR99-1 et OM99-2, ont 6te synthetises. OM99-2 est fond6 sur un octapeptide Glu-Val-Asn-Leu-Ala- 
Ala-Glu-Phe (SEQ ID NO:28) dont la liaison peptidique Leu-Ala a ete sut>stituee par un groupe hydroxyethylene comportant un 
tsostere en etat de transition (Fig. 1). La constante d'inhibition d'OM99-2 est 1,6 x 10-9<i, M contre la pro-memapsine (2) 
recombinante. La cristallographie de la memapsine (2) liee a cet inhibiteur a et6 utilisee pour determiner la structure 
tridimensiorinelle de la proteine. ainsi que I'importance des divers residus dans la liaison. Cette information peut etre utilisee 
par les specialistes en la matiere pour mettre au point de nouveaux inhibiteurs avec des logiciels du commerce et par des 
techniques connues des specialistes de la chimie organique ou de I'enzymologie. Ces specialistes peuvent ainsi mettre au point de 
nouveaux inhibiteurs de la memapsine (2) a des fins de diagnostic ou pour le traiternent et/ou la prevention de la maladie 
d'Aizheimer. 
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(54) Title: CATAL^TICaLLY ACTIVE RECOMBINANT MEM.APSIN AND METHODS OF USE THEREOF 

(57) Abstract: Methods for the production of purified, catalytically active, recombinant memapsin (2) have been developed. The 
substrate and subsite specificity of the catalytically active enzyme have been detemrined. The substrate and subsite specificiiy in- 
fonnation was used lo design substrate analogs of the natural memapsin (2) subsuaie that can inhibit the function of memapsin (2). 
The substrate analogs are based on peptide sequences, shown lo be related to the naftiral pepdde subsdTites for memapsin (2). The 
substrate analogs contain at least one analog of an amide bond which is not capable of being cleaved by memapsin (2). Processes for 
the synthesis of two substrate analogues including isosteres at the sites of the critical amino acid residues weie developed and Oie sub- 
strate analogues, OMR99- 1 and OM99-2, were synthesized. OM99-2 is based on an octapeptide Glu-Val-Asn-Leu-Ala-Ala-Glu-Phe 
(SEQ ID NO:28) with die Leu-Ala peptide bond substiuited by a transition-state isostere hydtoxyethylene grxnip (Fig. 1). The in- 
hibitioD constant of OM99-2 is 1.6 x 10? M against recombinant pro-memapsin 2. Crystallography of memapsin 2 bound to this 
inhibitor was used to detenmne the three dimensional structure of the protein, as well as the impoitance of the various residues in 
binding. This information can be used by those skilled in the art lo design new inhibitors, using commercially available software 
programs and techniques familiar to those in organic chemistry and cnzyinology, to design new inhibitors to memapsin (2), useful 
in diagnostics and for the treatment and/or prevention of Alzheimer* s (ti^ya^ 
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CATALVTICALLY ACmT RECOMBINANT MEMAPSIN 
AND METHODS OF USE THEREOF 

Background of the Invention 

This invention is in the area of the expression of caialyiically active 
Memapsin 2 (beia-secreiasc) and use thereof in the design and screening of 
specific inhibitors which are useful in the treatment and/or prevention of 
Alzheimer's Disease. 

Alzheimer's disease (AD) is a degenerative disorder of the brain first 
described by AJios Alzheimer in 1907 after examining one of his patients 
who suftered drastic reduction in cognitive abilities and had generalized 
dementia {77ie early story of Alzheimer's Disease, edited by Bick ct al. 
(Raven Press, New York 1987)). li is the leading cause of dementia in 
elderly persons. AD patients have increased problems with memory loss and 
intellectual functions which progress to the point where they cannot function 
as normal individuals. With the loss of intellectual skills the patients exhibit 
personality changes, socially inappropriate actions and schizophrenia {A 
Guide to the Understanding of Alzheimer's Disease and Related Disorders, 
edited by Jorm (New York University Press. New York 1987). AD is 
devastating for both victims and their families, for there is no eftectivc 
palliative or preventive treatment for the inevitable neurodegeneration. 

AD is associated with neurilic plaques measuring up to 200 nm in 
diameter in the cortex, hippocampus, .subiculuni. hippocampal gyms, and 
amygdala. One of the principal constituents of neurilic plaques is amyloid, 
which is stained by Congo Red (Fisher ( 1 983): Kelly Microbiol. Sci. 
1(9);2 14-2 19 (1984)). Amyloid plaques stained by Congo Red are 
extracellular, pink or rust-colored in bright field, and birefringent in 
polarized light. The plaques are composed of polypeptide fibrils and are 
often present around blood vessels, reducing blood supply to various neurons 
in the brain. 

Various factors such as genetic predisposition, infectious agents, 
toxins, metals, and head trauma have all been suggested as possible 
mechanisms of AD neuropathy. Available evidence strongly indicates that 
there arc distinct types of genetic predispositions for AD. First, molecular 
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analysis has provided evidence for muiaiions in the amyloid precursor 
protein ( APP) gene in certain AD-strickcn families (Goaie et al. Nature 
349:704-706 (1991); Munell et al Science 254:97-99 (1991); Charticr- 
Harlin ei al. Nature 353:844-846 (1991); Mullan ei al.. Nature Genet. 1:345- 
347 (1992)). Additional genes for dominant forms of early onset AD reside 
on chromosome 14 and chromosome 1 (Rogacv et ai. Nature '^16:115-11% 
(1995); Levy-Lahad et aL Science 269:973-977 (1995); Sherrington et aL, 
J5 \iature 375:754-760 (1995)). Another loci associated with AD resides on 

chromosome 1 9 and encodes a variant form of apolipoprotein E (Corder, 
Science 261:921-923(1993)), 

Amyloid plaques are abundantly present in AD patients and in 
Down's Syndrome individuals surviving to the age of 40, The 
overexpression of APP in Dovm's Syndrome is recognized as a possible 
cause of the development of AD in Down's patients over thirty years of age 
(Rumble ei aL New En^lantlJ, Med. 320:1446-1452 (1989); Mann et aL 
NcurobioL Aging 10:397-399 (1989)). The plaques are also present 'm the 
normal aging brain, although at a lower number. These plaques are made up 
primarily of the amyloid p peptide (Ap: sometimes also referred to in the 
literature as p-amyloid peptide or p peptide) (Glcnner and Wong, Biockem. 
Biophys. Res. Comm. 120:885-890 (1 984)), which is also the primary protein 
constiiueni in cerebrovascular amyloid deposits. The amyloid is a 
2^ fllamentoas material that is arranged in beia-pleated sheets. Ap is a 

hydrophobic peptide comprising up to 43 amino acids. 

The determination of its amino acid sequence led to the cloning of the 
APP cDNA (Kang et al. Nature 325:733-735 (1987); Goldgaber et al. 
Science 235:877-880 (1987); Robakis et al. Proc Natl. Acad. Sci. 84:4190- 
4194 (1987): Tanzi et al. Nature 331:528-530 (1988)) and genomic APP 
DNA (Lcmaire et al.NtJcL Acids Res. 17:517-522 (1989): Yoshikai et aL 
Gene 87, 257-263 ( 1 990)). A number of forms of APP cDNA have been 
identified, including the three most abundant forms. APP695. APP751, and 
APP770. 'i hese forms arise from a single precursor RNA by alternate 
splicing. The gene spans more than 1 75 kb with 1 8 cxons (Yoshikai et al 
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( 1990) ). APP coniaiiuj an extracellular domain, a transmembrane region iind 
a cytoplasmic domain. Ap consists of up to 28 amino acids just outside the 
hydrophobic transmembrane domain and up to 15 residues of this 
transmembrane domain. AP is nomially found in brain and other tissues 
such as heart, kidney and spleen. However, AP deposits are usually found in 
abundance only in the brain. 

Van Brocckhaven e/ al.. Science 248:1 120-1 122 (1990), have 
demonstrated that the APP gene is tightly linked to hereditary cerebral 
hemorrhage with amyloidosis (HCHWA-D) in two Dutch families. This was 
confirmed by the finding of a point mutation in the APP coding region in two 
Dutch patients (LevA' et aU Science 248: 1 124-1 128 (1990)). The mutation 
substituted a glutamine for glutamic acid at position 22 of the Ap (position 
618 of APP695, or position 693 of APP770). In addition, certain families arc 
genetically predisposed to Alzheimer's disease, a condition referred to as 
familial Alzheimer's disease (FAD), through mutations resulting in an amino 
acid replacement at position 717 of the full length protein (Goate et al. 

(1 991 ) ; MuTTcIl et aL (1991); Chanier-Harlin e/ £i/. (1991)). These mutations 
co-segregate with the disease within the families and are absent in families 
with late-onset AD. This mutation at amino acid 717 increases the 
production of the Ap;-j2 form of AP from APP (Suzuki et al.. Science 

264: 1 336- 13 40 (1994)). Another mutant form contains a change in amino 
acids at positions 670 and 671 of the full length protein (Mutlan et al. 

(1 992) ). This mutation to amino acids 670 and 671 increa.ses the production 
of total AP from APP (Citron et aL Nature 360:622-674 (1992)). 

APP is processed in vivo at three sites. The evidence suggests that 
cleavage at the B -secretase site by a membrane associated metal loprotease is 
a physiological event. This site is located in APP 12 residues away from the 
lumenal surface of the plasma membrane. Cleavage of the fi -secretase site 
(28 residues from the plasma membrane's lumenal surface) and the B - 
secretase site (in the transmembrane region) results in the 40/42-residue B • 
amyloid peptide (A B), whose elevated production and accumulation in the 
brain are the central events in the pathogenesis of Alzheimer's disease (for 
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review, see Selkoe, D.J. Nature 399:23-31 (1999)). Prcsenilin 1. another 
membrane protein found in human brain, controls the hydrolysis at the APP ( 
C -sccreiase site and has been postulated to be itself the responsible protease 
(Wolfe, M.S.ei a\,. Nature 398:513-517 (1999)). Presenilin I is expressed as 
a single chain molecule and its processing by a protease, prcsenilinase, is 
required to prevent it from rapid degradation (Thinakaran, G. et al.» Neuron 
17:181-190 (1996) and Podlisny. M.B., et al., Neurobiol. Dis. 3:325-37 
15 (1997)). The identity of prcsenilinase is unknown. The in vivo processing of 

the 0 -secrctase site is thought to be the rate-limiting step in A fl production 
(Sinha, S. & Lieberburg, L, Proc. Nail. Acad Sci,. USA. 96:1 1049-1 1053 
( 1 999)), and is therefore a strong therapeutic target. 

The design of inhibitors effective in decreasing amyeloid plaque 
formation is dependent on the identification of the critical cn2yme(s) in the 
cleavage of APP lo yield the 42 amino acid peptide, the ApM2 form of A|J. 
25 Although several enzymes have been identified, it has not been possible to 

produce active enzyme. Without active enzyme, one cannot confirm the 
substrate specifjcit>', determine the subsite specificity, nor determine the 
kinetics or critical active site residues, all of which arc essential for the 
design of inhibitors. 



20 



30 



35 



40 



Memapsin 2 has been shown lo be beta-sccreiase. a key protease 
involved in the production in human brain of beta-amyloid peptide from 
beta-amyloid precursor protein (for review, see Selkoe. D.J. Nature 399:23- 
31(1 999)). It is now generally accepted that the accumulation of beta- 
amyioid peptide in human brain is a major cause for the Alzheimer's 
disease. Inhibitors specifically designed for human memapsin 2 should 
inhibit or decrease the formation of beta-amyloid peptide and the 
progression of the Alzheimer's disease. 

Memapsin 2 belongs to the aspartic protease family. It is 
45 homologous in amino acid sequence to other eukaryoiic aspanic proteases 

and contains motifs specific lo that family. These suiictural similarities 
predict that memapsin 2 and other eukaryoiic aspartic proteases share 
common catalytic mechanism Davies, D.R., Annu. Rev, Biophys. Chem. 19, 
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189(1 990). The most successful inhibitors for aspanic prolcases arc mimics 
of the iransiiion state of these enzymes. These inhibitors have substrate -like 
structure with the cleaved planar peptide bond between the caibonyl carbon 
and the amide nitrogen replaced by two tctrahcdral atoms, such as 
hydroxyethylene [-CH(0H)-CH2-], which was originally discovered in the 
structure of pepstaiin (Marciniszyn et al.. 1976). 

However, for clinical use, it is preferable to have small molecule 
inhibitors which will pass through the blood brain barrier and vi'hich can be 
readily synthesized. It is also desirable that the inhibitors are relatively 
inexpensive to manufacture and that they can be administered orally. 
Screening of thousands of compounds for these properties would require an 
enormous effort. To rationally design memapsin 2 inhibitors for treating 
Alzheimer's disease, it will be important to know the three-dimensional 
structure of memapsin 2, especially the binding mode of an inhibitor in tlie 
active site of this protease. 

It is therefore an object of the present invention to provide purified, 
recombinant, and active memapsin 2, as well as its substrate and subsite 
specificity and critical active site residues. 

It is a further object of the present invention to provide compositions 
and methods for synthesis of inhibitors of memapsin 2. 

h is a still further object of the present invention to provide 
coinposi lions that interact with memapsin 2 or its .substrate to inhibit 
cleavage by the memapsin 2 which can cross the blood brain barrier (BBB). 

It is therefore an object of the present invention to provide means for 
rational design and screening of compounds for inhibition of mamapsin 2. 

Summary of the Invention 

Methods for the production of purified, catalylically active, 
recombinant memapsin 2 have been developed. The substrate and subsite 
specificity of the catalytically active en7\'me have been determined. The 
active enzyme and assays for catal>iic activity are useful in screening 
libraries for inhibitors of the enzyme. 
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The substrate and subsite specificity information was used to design 
substrate analogs of the natural memapsin 2 substrate that can inhibit the 
hinction of mcmapsin 2. The substrate analogs are based on peptide 
sequences, shown to be related to the natural peptide substrates for 
memapsin 2. The substrate analogs contain at least one analog of an amide 
(peptide) bond which is not capable of being cleaved by memapsin 2. 
Processes for the synthesis of two substrate analogues including isosteres at 
ilie sites of the critical amino acid residues were developed and the substrate 
analogues. 0MR99-1 and OM99-2, were synthesized. OM99-2 is based on 
an octapeptide Glu-Val-Asn-Leu-Ala-Ala-Glu-Phe (SEQ ID NO:28) with 
the Leu-Ala peptide bond substituted by a transition-state isostere 
20 hydroxyethylcne group. The inhibition constant of OM99-2 isl.6 x 10*^ M 

against recombinant pro-memapsin 2. Crystallography of memapsin 2 
bound to this inhibitor was used to determine the three dimensional 
structure of the protein, as well as the importance of the various residues in 

25 

binding. 

This information can be used by those skilled in the art to design 
new inhibitors, using commercially available software programs and 
30 techniques familiar to those in organic chemistry and enzymology. to design 

new inhibitors. For example, the side chains of the inhibitors may be 
modified to produce stronger interactions (through hydrogen bonding, 
hydrophobic interaction, charge interaction and/or van der Waal interaction) 
in order to increase inhibition potency. Ba.sed on this type of information, 
the residues with minor interactions may be eliminated from the new 
inhibitor design to decrease the molecular weight of the inhibitor. The side 
chains with no structural hindrance from the enzyme may be cross-linked to 
lock in the effective inhibitor conformation. This type of structure also 
enables the design of peptide surrogates which may effectively fill the 
binding sites of memapsin 2 yet produce better pharmaceutical properties. 
^5 jhe e.xamples demonstrate the production of catalyiically active 

enzyme, design and synthesis of inhibitors, and how the crystal structure was 
obtained. The examples thereby demonsn-aic how the methods and materials 
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described herein can be used to screen libraries of compounds for other 
inhibitors, as well as for design of inhibitors. These inhibitors are useful in 
the prevention and/or treatment of Alzheimer's disease as mediated by the 
action of the beta secrctasc memapsin 2, in cleaving APP. 

Brief Description of the Drawings 

Figure 1 depicts the plasmid construct of vecior pET-1 la-memapsin 
2-Tl and pET-1 la-memapsin 2-T2. The T7 promoton amino acid sequence 
from the vector (T7 protein) (SEQ ID N0:3), and the beginning and ending 
of the memapsin 2 Tl and T2 construct arc showf-n. Construct promemapsin 
2-Tl was used in the preparation of protein for crystallization and includes 
residues 1 v- 1 5v which arc derived from vector pR T- 1 1 a. Residues 1 p-48p 
are putative pro-peptide. Residues 1-393 correspond to the mature protease 
domain and C-tcrminal extension. The residue numherint: of memapsin 2 
starts at the aligned N-terminal position of pepsin (Figure 3). 

Figure 2A is a graph of the initial rate of hydrolysis of synthetic 
peptide swAPP (see Table I ) by ^t2p,f at different pH. Figure 2B is a graph 
of the relative kcm/Km values for steady-state kinetic of hydrolysis of peptide 
substrates by MZptj. 

Figures 3 A and 3B are the chemical structures of memapsin 2 
inhibitors, OM99-2 and OM99-1. 

Figure 4A is a graph of the inhibition of recombinant memapsin 2 by 
OM99-1 . Figure 4B is a graph of the inhibition of recombinant memapsin 2 
by OM99-2. 

Figures 5A-E are photographs of crystals of recombinant memapsin 
2-OM99-2 complex. 

Figure 6 is a stereo view of crv'stal structure oi" memapsin 2 protease 
domain with bound OM99-2The polypeptide backbone of memapsin 2 is 
shown as a ribbon diagram. The N-lobc and C- lobe are blue and yellow, 
respectively, except the insertion loops (designated A to G. see Figure 2) in 
the C-lobe are magenta and the C-terminal extension is green. The inhibitor 
bound between the lobes is shown in red. 
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Figure 7 is a stereo view of comparison of the three-dimensional 
structures of memapsin 2 and pepsin. The molecular surface of the former is 
sienificantly larger by the insertion of surface loops and helix and the C- 
terminal extension. Chain tracing of human memapsin 2 is dark blue and is 
grey for human pepsin. The light blue bails represent identical residues 
which are topologically equivalent. The disulfide bonds are shown in red for 
memapsin 2 and orange for pepsin. The C-terminal extension is in green. 

Figure 8 is a schematic presentation of interaction between OM99-2 
and memapsin 2 protease domain. The Sy and S4 subsites are not defmed. 

Figure 9 is a stereo presentation of interactions between inhibitor 
OM99-2 (orange) and memapsin 2 (light blue). NiU-ogen and oxygen atoms 
20 are marked blue and red, respectively. Hydrogen bonds arc indicated in 

yellow dotted lines. Memapsiji 2 residues which comprise the binding 
subsites are included. Residues P4, P3, P2. Pi and Pi' (defined in Figure 8) of 
OM99-2 are in an extended conformation. Inhibitor chain turns at residue 
P2' which makes a distinct kink at this position. The backbone of Pi' and P4' 
directs the inhibitor to exit the active site. 

Figure 10 are schematics of the cross linking between Pj Val and P] 
30 Leu side chains in the design of new inhibitors for memapsin 2 based on the 

current cr\'stal structure. R and R" at positions P2 and Pt' indicate amino 
acid side chains. Other structural elements of inhibitor are omiued for 
clarity. 

Figure 11 are schematics of the cross linking between P4 Glu and P2 
Asn side chains in the design of new inhibitors for memapsin 2 based on the 
current cry-<;tal structure. R at position P.; indicates ainino acid side chain. 
Oiher structural elements of inhibitor are omitted for clarity. 

Figure 12 is a schematic of the design for the side chain at the Pi' 
subsite for the new memapsin 2 inhibitors based on the current crystal 
structure. Arrows indicate possible interactions between memapsin 2 and 
inhibitor. Other structural elements of inhibitor are omitted for clarity. 

Figure 13 is a schematic of the design of two six-membercd rings in 
the inhibitor structure by the addition of atoms A and B. Tlie ring formation 
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involves the Pi-L€u side chain the the peptide backbone near Pi. P:. and Pv 
The new bonds arc in dotted lines. A methyl group can be added to the beta- 
carbon of Pi-Leu. Other structural elements of inhibitor are omitted for 
clarily. 

Detailed Description of the invention 
I. Prenaratiop of Catalvticalh' Active Recomb inant Mcmansin 2 

C'loning and Expression of Memapsin 2 
15 Memapsin 2 was cloned and the nucleotide (SEQ ID NO. 1 ) and 

predicted amino acid (SEQ ID NO. 2) sequences were determined, as 
described in Example 1 . The cDNA was a.<;sembled from the fragments. The 
nucleotide and the deduced protein sequence are showTi in SEQ ID NOs. 1 
and 2, respectively. The protein is the same as the aspartic proteinase 2 
( ASP2) described in EP 0 855 444 A by SmithKline Beecham 
Pharmaceuticals, (published July 29, 1 998 ), and later reported by Siniia, et 
al.. Nature 402. 537-540 (December 1999) and Vassar, et al.. Science 286. 
755-741 (22 October 1 999). 

Pro-memapsin 2 is homologous to other human aspartic proteases. 
Based on the alignments, Pro-memapsin 2 contains a /?rt? region, an aspartic 
30 protease region, and a trans-membrane region near the C-terminus. The C- 

terminal domain is over 80 residues long. The active enzyme is memapsin 2 
and its pro-enz>'nie is pro-memapsin 2. 

Refoldintr CataJvticallv Active Enzyme 
in order to determine the substrate specificity and to design 
inhibitors, it is necessary to express catalyiically active recombinant enzyme. 
No other known proteases contain a transmembrane domain. The presence 
40 of transmembrane domains makes the recombinant e.xprcssion of these 

proteins less predictable and more difficult. The transmembrane region often 
needs to be removed so that secretion of the protein can lake place. 
However, the removal of the transmembrane region can often alter the 

45 

Structure and/or function of the protein. 

The starting assumption was that the region of memapsin 2 that is 
homologous with other aspartic proteases would independently fold in the 
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absence of the transmembrane domain, and would retain protease activity in 
the absence of the C-tcrminal transmembrane region. The transmembrane 
region appears to ser\'e as a membrane anchor. Since the active site is not in 
the transmembrane region and activity docs not require membrane 
anchoring, memapsin 2 was expressed in £ coli in two different lengths, 
both without the transmembrane region, and purified, as described in 
Example 3. The procedures for the culture of transfected bacteria, induction 
of synthesis of recombinant proteins and the recovery and washing of 
inclusion bodies containing recombinant proteins are essentially as described 
by Lin et ah, (1994). Refolding was not a simple matter, however. Two 
different refolding methods both produced satisfactory results. In both 
methods, the protein was dissolved in a strong denaturing/reducing solution 
such as 8 M urea/ 100 mM beta-mcrcaptoelhanol. The rate at which the 
protein was refolded, and in what solution, was critical to activity. In one 
method, the protein is dissolved into 8 M urea/ 100 mM beta- 
mercaptoethanol then rapidly diluted into 20 volumes of 20 mM-Tris, pH 
9.0. which is then slowly adjusted to pH 8 with 1 M HCL The refolding 
solution was then kept at 4** C for 24 to 48 hours before proceeding with 
purification, in the second method, an equal volume of 20 mM Tris, 0.5 mM 
oxidized/'l .25 mM reduced glutathione, pH 9.0 is added to rapidly stirred 
pro-memapsin 2 in 8 M urea/10 mM beta-mercapioethanol. The process is 
repeated three more limes with 1 hour inter\'als. The resulting solution is 
then diaiyzed against sufficient volume of 20 mM Tris base so that the final 
urea concentration is 0.4 M. Tlie pH of the solution is then slowly adjusted to 
8.0 with 1 M HCl. 

The refolded protein is then further purified by column 
chromatography, based on molecular weight exclusion, and/or elulion using 
a sail gradient, and analyzed by SDS-PAGE analysis under reduced and non- 
reduced conditions. 
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II. Substrate Snecificitv and Enzyme Kinetics of Mcmapsin 2 

Substrate Specificity 

The tissue distribution of the memapsin 2 was determined, as 
described in Example 2. The presence of memapsin 2 (M2) in the brain 
indicated that it might hydrolyze the p-amyloid precursor protein (APP). As 
described below, detailed enzymatic and cellular studies demonsuated that 
M2 fiLs all the criteria of the fl-secretase. 

The M2 three-dimensional suiicture modeled as a type 1 integral 
membrane protein. The model suggested that lis globular protease unit can 
hydroly-ze a membrane anchored polypeptide at a distance range of 20-30 
residues from the membrane surface. As a transmembrane protein of the 
brain. APP is a potential substrate and its beta-secretase site, located about 
28 residues from the plasma membrane surface, is within in the range for M2 
proteolysis. 

A synthetic peptide derived from this site {SRVKMODAEFR) (SEQ 
ID NO:4) was hydrolyzed by M2pd (modified M2 containing amino acids 
from Ala"*"' to AJa^^*") at the beia-secretase site (marked by /). A second 
peptide (SEVNL/DACFR) (SEQ ID NO: 5) derived from tlie APP beta- 
secreiase site and containing the 'Swedish mutation' (Mullan, M. et ai. 
Nature Genet. 2:340-342 (1992)), known to elevate the level of alpha-beta 
production in cells (Citron. M. et al.. feature 260:672-674 ( ! 992)). was 
hydrolyzed by M2p(j with much higher catalytic efficiency. Both substrates 
were optimally cleaved at pH 4.0. A peptide derived from the processing site 
of presenilin 1 (SVNM/AEGD) (SEQ ID NO:6) was also cleaved by M2pd 
with less efficient kinetic parameters. A peptide derived ft^m the APP 
gamma-secreiase site (KGGVVIATVIVK) (SEQ ID N0:7) was not cleaved 
by }A2^. Pepstatin A inhibited M2pd poorly f ICv) approximately 
approximately 0.3 mM). The kinetic parameters indicate that both presenilin 
1 (k;..- 0.67 s''; Kn,, 15.2 mM; kcn/K„. 43.8 s'm') and native APP peptides 
(kca)/K„». 39.9 s 'M"') are not as good substrates as the Swedish APP peptide 
(kc3.. 2.45 s-',K„„ 1 mM: k«,/K„>. 2450 s\V\ 



11 



I o determine if M2 possesses an APP beta-secreiase function in 
mammalian cells, mcmapsin 2 was transiently expressed in HcLa cells (Lin. 
X., ei a].. FASEB J, 7: 1 070-1 080 ( 1 993)), meiaboHcally pulse-labeled with 
^^S-Mct then immunoprecipitaied with anii-APP antibodies for visualization 
of APP-generatcd fragments after SDS-polyacr\ lamide electrophoresis and 
imagini!. SDS-PAGE patterns of immuno-pnErcipitated APP NU-fragment 
(97 kD band) from the conditioned media (2 h) of pulse-chase experiments 
showed that .APP was cleaved by M2. Controls transfected with APP alone 
and co-u-ansfected with APP and M2 with Bafilomycin Al added were 
performed. SDS-PAGE patterns of APP BC-fragment (12 kD) were 
immunoprecipitatcd from the conditioned media of the same experiment as 
discussed above. Controls transfected with APP alone; co-transfected with 
APP and Ml: co-transfecled with APP and M2 with Bafilomycin Al ; 
transfections of Swedish APP; and co-transfections of Swedish APP and M2 
were performed. SDS-PAGE gels were also run of immuno-precipitated M2 
(70 kD). M2 transfected cells; untransfected I leLa cells after long time film 
exposure; and endogenous M2 from HEK 293 cells. SDS-PAGE patterns of 
APP fragments (100 kD betaN-fragment and 95 kD beiaN- fragment) 
recovered from conditioned media after immuno-precipitalion using 
antibodies specific for different APP regions indicated that mcmapsin 2 
cleaved APP. 

Cells expressing both APP and M2 produced the 97 kD APP beta N- 
Iragnnent { from the N-tcrminus to the bcta-secrciase site) in the conditioned 
media and the 12 kD betaC-fragment (from the beta-secretasc site to the C- 
terminus) in the cell lysiaie. Controls uansfecied v\ilh APP alone produced 
little delectable betaN-fragment and no beta C-fragment. Bafilomycin Al, 
which is known to raise the intra- vesicle pH of lysosomes-^endosomes and 
has been shown to inhibit APP cleavage by beia-sccreiase (Knops, J. et al„ J. 
Biol. Chem. 270:24 1 9-2422 (1995)), abolished the production of both APP 
fragments beta N- and beta C- in co-transfected cells. Cells transfected with 
Swedish APP alone did not produce the beta C-fragmenl band in the cell 
1 ysate hut the co-tran.sfection of Swedish APP and M2 did. This Swedish 
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beta C-fragment band is more intense than that of wld-type APP. A 97-kD 
beta N-band is also seen in the conditioned media but is about equal imensit>' 
as the wild-type APP transfection. 

These results indicate that M2 processes the beta-sccretase site of 
APP in acidic compartments such as the endosomes. To establish the 
expression of oansfected M2 gene, the pulse-labeied ceils were lysed and 
immuno-precipitated by anti-M2 antibodies. A 70 kD M2 band v,iis seen in 
cells transfected with M2 gene, which has the same mobility as the major 
band from HEK 293 cells known to express beta-secretase (Citron. M. el al.. 
Nature 260:672-674 (1992)). A ver>' faint band of M2 is also seen, after a 
lonji film exposure, in untransfected HeLa cells, indicatini; a vcr>- low level 
of endogenous M2. which is insutTicienl to produce beuN- or betaC- 
fragments without M2 transfeclion. Antibody alpha-betaMT. which 
specifically recognizes residues 1-17 in alpha-beta peptide, was used to 
confirm the correct beta-secretase site cleavage. In cells transfected with 
APP and M2, both beta N- and beta N-fragraents are visible using an 
antibody recognizing the N -terminal region of APP present in both 
fragments. Antibody Abeiai.n recognize the beta N-fragmcnt produced by 
endogenous beta-sccretase in the untransfecied cells. This antibody wos, 
however, unable to recognize the hetaN-fragment knowTi to be present in 
cells co-transfected with APP and M2. Tlicse observations confirmed that 
betaN -fragment is the product of beta-sccretase site cut by M2. which 
abolished the recognition epitope of alpha-beta i. a- 

The processing of APP by M2 predicts the intracellular colocaiization 
of the two proteins- HeLa cells co-expressing APP and M2 were stained 
with antibodies directed toward A?? and M2 and visualized simultaneously 
by CSLM using a lOOx objective. Areas of colocalization appeared in 
yellow. 

Immunodetection observed by confocal microscopy of both APP and 
M2 revealed their colocalization in the superimposed scans. The distribution 
of both proteins is consistent with their residence in lysosomal/endosomal 
compartments. 
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In specificity studies, ii was found that M2pd cleaved its pro peptide 
(2 sites) and the protease portion (2 sites) during a 16 h incubation after 
activation (Table 1). Besides the three peptides discussed above, M2pd also 
cleaved oxidized bovine insulin B chain and a synthetic peptide Nch. Native 
proteins were not cleaved by M2p<j- 

The data indicate that human M2 fulfills all the criteria of a beta- 
secretase which cleaves the beta-amyloid precursor protein (APP): (a) M2 
and APP are both membrane proteins present in human brain and co-localize 
in mammalian cells, (b) M2 specifically cleaves the beia-sccretase site of 
synthetic peptides and of APP in cells, (c) M2 preferentially cleaves the bcta- 
secretase site from the Swedish over the vvild-tx^pe APP, and (d) the acidic 
pH optimum for M2 activity and bafilomycin Al inhibition of APP 
processing by M2 in the cells are consistent with the previous observations 
that bela-secrctase cleavage occurs in acidic vesicles (Knops, J., et al.. J. 
Biui Cham, 270:2419-2422 (1995)). The spontaneous appearance of activity 
of recombinant pro-M2 in an acidic solution suggests thau intraccllularly, 
this zymogen can by itself generate activit>' in an acidic vesicle like an 
endosome. 

II. Design and Synthesis of Inhibitors 

Desiun of Substrate Analogs for Memapsin 2, 
The five human aspartic proteases have homologous amino acid 
sequences and have similar three-dimensional structures. There arc two 
aspartic residues in the active site and each residue is found within the 
signature aspartic protease sequence motif Asp-Thr/Ser-Gly- (SEQ ID 
NO: 8). There are generally two homologous domains within an aspartic 
protease and the substrate binding site is posiiioned between these two 
domains, based on the three-dimensional structures. The substrate binding 
sites of aspartic proteases generally recognize eight amino acid residues. 
There are generally four residues on each side of the amide bond which is 
cleaved by the aspartic protease. 

Typically the side chains of each amino acid are involved in the 
specificity of the substrate/aspartic protease interaction. Tlie side chain of 
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each substrate residue is recognized by regions of the enz\Tne which are 
collectively called sub-sites. The generally accepted nomenclature for the 
protease sub-sites and their correspondinj; substrate residues are shown 
below, where the double slash represents the position of bond cleavage. 
Proteast? sub"Sitcs S4 S3 S2 SI ST S2' S3' 84* 

Substrate residues P4 P3 P2 PI//Pr P2' P3' P4' 

While there is a general motif for aspanic protease substrate 
recognition, each protease has a very different substrate specificity' and 
breadth of specificity. Once the specificity of an aspartic protease is known, 
inhibitors can be designed based on that specificity, which interact with the 
aspartic protease in a way that prevents natural substrate from being 
efficiently cleaved. Some aspartic proteases have specificities which can 
accorrunodatc many different residues in each of the sub-sites for successful 
hydrolysis. Pepsin and cathepsin D have this type of specificity and are said 
to have "broad" substrate specificity. When only a very few residues can be 
recognized at a sub-site, such as in renin, the aspartic protease is said to have 
a stringent or narrow specificity. 

The information on the specificity of an aspartic protease can be used 
lo design specific inhibitors in which the preferred residues are placed at 
specific sub- sites and the cleaved peptide bond is replaced by an analog of 
the transition-state. These analogs are called transition state isosteres. 
Aspanic proteases cleave amide bonds by a hydro lytic mechanism. This 
reaction mechanism involves the attack by a hydroxide ion on the B-carbon 
of the amino acid. Protonation must occur at the other atom attached to the 
B-carbon through the bond that is lo be cleaved. If the (^carbon is 
insulTiciently electrophilic or the atom attached to the bond to be cleaved is 
msufficiently nucleophilic the bond will not be cleaved by a hydrolytic 
mechanism. Analogs exist which do not mimic the transition state but which 
arc non-hydrolyzable, but transition stale isosteres mimic the transition state 
specifically and are non-hydro I N'zabie. 

Transition state theory indicates that it is the transition state 
intermediate of the reaction which the enzyme catalyzes for which the 
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cn2yme has its highest affmit>\ li is the imnsiiion state structure, not the 
ground stale structure, of the substrate which will have the highest affinity 
for its given enzyme. The transition state for the hydrolysis of an amide 
bond is teirahedral while the ground state structure is planar. A typical 
transition-state isostere of aspartic protease is -CH(0H)-CH2-. as was first 
discovered in pepsiatin by Marciniszyn et al. (1976). The transition-state 
analogue principles have been successfully applied to inliibitor drugs for 
f5 human immunodeficiency virus protease, an aspartic protease. Many of 

tliesc are currently in clinical use. Information on the structure, specificity, 
and types of inhibitors can be found in Tang, Acid Proteases, Structure, 
Function and Biology, Adv. in Exptl. Med. Biol. vol. 95 (Plenum Press, NY 
1977): Kostka. Aspartic Proteinases and their inhibitors (Walter de Gruyter, 
Berlin 1985): Ounn. Structure and Functions of the Aspartic Proteinases. 
Adv. in Exptl. Med. Biol. 306 (Plenum Press, NY 1991): Takahashi, 
Aspanic Proteases, Structure, Function, Biology, Biomedical Implications. 
Adv. in Exptl. Med. Biol. 362 (Plenum Press, NY 1995); and James, 
Aspartic Proteinases, Retroviral and Cellular Enzymes. Adv. in Exptl. Med. 
Biol. 436 (Plenum Press. NY 1998)). 
30 Sub.sirate analog compositions are generally of the general formula 

X- L4-P4- LrPi-L2-P2-LrPrL,rPr-Li'-P2M.2'-P3^-L,'-P4'L4'-Y. The 
siibstraie analog compositions are analogs of small peptide molecules. Their 
basic structure is derived from peptide sequences that were determined 
through siructure-'function studies. It is understood that positions represented 
by P, represent the subsu-aie specificity position relative to the cleavage site 
which is represented by an -Lo-. The positions of the compositions 
40 represented by Lx represent the linking regions between each substrate 

specificity- position, Px- 

In a natural substrate for memapsin 2, a Px-Lx pair would represent a 
single amino acid of the peptide which is to be cleaved. In the present 
general formula, each Px part of the formula refers to the a-carbon and side 
chain functional group of each would be amino acid. Thus, the Px portion of 
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an Px-Lx pair for alanine rqiresents HC- CH^. The general formula 
representing the Px portion of the general composition is -RiCRv- 

In general R| can be cither CH;* (side chain of alanine), CHCCHjh 
{side chain of valine), CH2CH(CH3)2 (side chain of leucine), (CH?)CH(CH2 
CH:) (side chain of isoleucine). CH2(lndoie) (side chain of tryptophan), 
CH2(Benzenc) (side chain of phenylalanine). CHiCHjSCHsCsidc chain of 
methionine), II (side chain of glycine), CHzOII (side chain of serine), 
CHOHCHa (side chain of threonine). CIl2(Phcnol) (side chain of tyrosine). 
CHjSH (side chain of cysteine), CH2CH3CONH2 (side chain of glutamine). 
CII2CONH2 (side chain of asparagine), CH2CH2CH2CH3NH2 (side chain of 
lysine). CIl3CH2CH2NHC(NH)(NH2) (side chain of arginine), 
CH^dmidazole) (side chain of hislidinc). CH.COOH (side cham of aspartic 
acid). CH3CH2COOH (side chain of glutamic acid), and functional naniral 
and non-natural derivatives or synthetic subsiiluiions of these. 

It is most preferred that R3 is a single 1 1. In general, hov^ever, can 
be alkenyl. alkynal. alkenyloxy, and alkynyloxy groups that allow binding to 
memapsin 2. Preferably, alkenyl, alkynyl, alkenyloxy and alkynyloxy 
groups have from 2 to 40 carbons, and more preferably from 2 to 20 carbons, 
from 2 to 10 carbons, or from 2 to 3 carbons., and functional natural and 
non-nntural derivatives or synthetic substitutions of these. 

1 he Lx ponion of the Px-L^ pair represents the atoms linking the Px 
regions together. In a natural substrate the U represents the U-carbon 
attached to the amino portion of what would be the next amino acid in the 
chain. Thus, would be represented by -CO-NH-. The general formula for 
I.^ is represented by R2. In general R2 can be CO-l-rN (amide), 
CH{0H){C:H2) (hydroxycthyiene), CH(OH)CIUOH) (dihydroxyethylcnc). 
CH(C)H)CH2NH (hydroxyethylamine). P0(0H)CH2 (phosphinate), CH3NII 
(reduced amide). It is understood that more than one L- maybe an isostere as 
long as the substrate analog functions to inhibit aspartic protease function, 

Ls which arc not isosicres may either be an amide bond or mimetic of 
an amide bond that is non-hydrolyzable. 
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5 X and Y represent molecules which are not typically involved in the 

recognition by the aspartic protease recognition site, but which do not 
interfere with recognition. It is preferred that these molecules confer 
resistance to the degradation of the substrate analog. Preferred examples 
■ would be amino acids coupled to the substrate analog through a non- 

hydrolyzable bond. Other preferred compounds would be capping agents. 
Still other preferred compounds would be compounds which could be used 
in the purification of the substrate analogs such as biotin. 

As used herein, alkyJ refers lo substituted or unsubstitutcd straight, 
branched or cyclic alkyl groups; and alkoxyl refers to substimted or 
unsubstituied straight, branched or cyclic alkoxy. Preferably, alkyl and 
20 alkoxy groups have from 1 to 40 carbons, and more preferably from 1 lo 20 

carbons, from 1 to 10 carbons, or from 1 to 3 carbons. 

As used herein, alkenyl refers to substituted or unsubslituted 
straight chain or branched alkenyl groups; alkynyl refers to substituted or 
unsubstitutcd straight chain or branched alkynyl groups; alkenyloxy refers to 
substituted or unsubstitutcd straight chain or branched alkenyloxy; and 
alkynyloxy refers to substituted or unsubstitutcd straight chain or branched 
30 alkynyloxy. Preferably, alkenyK alkynyl, alkenyloxy and alkynyloxy groups 

have from 2 to 40 carbons, and more preferably from 2 to 20 carbons, from 2 
to 1 0 carbons, or from 2 to 3 carbons. 

As used herein, alkaryl refers to an alkyl group that has an aryl 
subsiituent; aralkyl refers to an aryl group that has an alkyl substitucnt; 
heierocyclic-alkyl refers to a heterocyclic group with an alkyl substituenl; 
alkyl-heterocyciic refers to an alkyl group that has a heterocyclic substituenl. 
The suhstituents for alkvl, alkenyl, alkynyl. alkoxy. alkenyloxy. and 

40 

alkynyloxy groups can be halogen, cyano. amino, thio. carboxy. ester, ether, 
thioeiher. carboxamide, hydroxy, or mercapto. Further, the groups can 
optionally have one or more methylene groups replaced with a hetcroatom. 
-'S such as O, NH or S. 

A number of different substrates were tested and analyzed, and the 
cleavage rules for Memapsin 2 were determined. The results of the 
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subsiraies which were analyzed arc presented in Table 1 and the rules 
determined from these results are summarized below. 

( 1 ) The primar>' specificity site for a memapsin 2 substrate is subsite 
position, Pi* This means that the most important determinant for substrate 
specificity in memapsin 2 is the amino acid. ST, Pi' must contain a small 
side chain for memapsin 2 to recognize the substrate. Preferred 
embodiments are substrate analogs where Ri of llie Pj" position is either II 
(side chain of glycine). CHj (side chain of alanine). CH^OH (side chain of 
serine), or CH2OOH (side chain of aspartic acid). Embodiments that have an 
Rl structurally smaller than CH3 (side chain of alanine) or CH2OH (side 
chain of serine) are also preferred. 

(2) There are no specific sequence requirements at positions P4. P3. Pj. Pw 
P:*, P3\ and Pj* Each site can accommodate any other amino acid residue in 
singularity as long as rule number 3 is met. 

(3) At least two of the remaining seven positions, P4, Pj, P2. Pi, Pz'. P3', and 
?a\ must have an R| which is made up of a hydrophobic residue. It is 
preferred that there are at least three hydrophobic residues in the remaining 
seven positions, P4. P5. P2. Pi. P2'. and Pi\ Preferred Ri groups for the 
positions that contain a hydrophobic group are CHr, (side chain of alanine), 
(:H(CH;)2 (side chain of valine), CH2CH(CI l3)2 (side chain of leucine). 
(CH3)CH(CH. CH3) (side chain of isoleucine), CHiClNDOLE) (side chain of 
Tryptophan). CH2(Benzene) (side chain of phenylalanine), CH2CH2SCH> 
(side chain of methionine) CH2(Phenol) (side chain of t>TOsine). It is more 
preferred thai the hydrophobic group be a large hydrophobic group. 
Preferred Ris which contain large hydrophobic groups are CH(CH3)2 (side 
chain of valine), CH2ClI(CH3h (side chain of leucine), (CH.OCIUCH. CH3) 
(side chain of isoleucine), CHzdndole) (side chain of tryptophan), 
CH2(Ben2ene) (side chain of phenylalanine). CH2CH2SCH3 (side chain of 
methionine) CH2(Phenol) (side chain of tyrosine). It is most preferred thai 
positions with a hydrophobic Ri are CH(CHj)2 (side chain of valine). 
CH2CH(CIh)2 (side chain of leucine), CH2(Bcn2ene) (side chain of 
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phenylalanine). CHjCH2SCHUiiide chain of methionine), or CHiCPhenol) 
(side chain of t>T0sinc). 

(4) None of the eight positions. P4, P3, P2, P|. Pi/ Pz'. Pj*^ and P4* may have 
a proline side chain at its Rl position. 

(5) Not all subsites must have an H represented in the analog. For example, 
a substrate analog could have X-Pj-Li-Pi-Lc-Pi'-I-r-Pi'-U'-Ps'-Ls'-Y or it 
could have X-L,-Pt-L(rPi'-Lr-P2'-I-2'-P:»'-L3'-P4"L4"-Y. 

Preferred substrate analogs are analogs having the sequences 
disclosed in Table 1. uith the non-hydrolyzable analog between PI and PP. 
Combinaiorial Chemis try to Make Inhibitors 

Combinatorial chemistrv' includes but is not limited to all methods for 
20 isolating molecules that are capable of binding cither a small molecule or 

another macromolecule. Proteins, oligonucleotides, and polysaccharides arc 
examples of macromolecules. For example, oligonucleotide molecules with 
a given function, catalytic or iigand-binding, can be isolated from a complex 
mixture of nmdom oligonucleotides in what has been referred to as '*/>i vitro 
genetics" (Szoslak, TIBS 19:89, 1992). One synthesizes a large pool of 
molecules bearing random and defined sequences and subjects that complex 
30 mixture, for example, approximately lO'' individual sequences in 100 ^ig of 

a 100 nucleotide KN.-^, to some selection and enrichment process. Through 
repeated cycles of affmity chromatography and PCR amplification of the 
molecules bound to the ligand on the column, Ellington and Szostak (1990) 
estimated that 1 in lO'** RNA molecules folded in such a way as to bind a 
small molecule dyes. DNA molecules with such Iigand-binding behavior 
have been isolated as well (Ellington and Szostak. 1992; Bock et al, 1992). 

Techniques aimed at similar goals exist for small organic molecules, 
proteins and peptides and other molecules known to those of skill in the art. 
Screening sets of molecules for a desired activity whether based on libraries 
of small synthetic molecules, oligonucleotides, proteins or peptides is 
broadly referred to as combinatorial chemistry. 

There are a number of methods for isolating proteins cither have Jc 
novo activity or a modifed activit> . For example, phage display libraries 
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have been used for a number of years. A preferred method for isolating 
proteins that have a given function is described by Roberts and Szostak 
(Robcns R.W. and Szostak J.W. Proc. NalL Acad. Sci. USA, 94(23)12997- 
302 (1997). Another preferred method for combinatorial methods designed 
to isolate peptides is described in Cohen et al. (Cohen B.A., et al.. Proc. Natl 
Acad. Sci. USA 95(24): 14272-7 (1998)). This method utilizes a modified 
two-hybrid technology. Yeast two-hybrid systems arc useful for the 
detection and analysis of proteinrproiein interactions. The two-hybrid 
system, initially described in the yeast Saccharomyces cerevisiae. is a 
powerful molecular genetic technique for identifying new regulatory 
molecules, specific to the protein of interest (Fields and Song, Nature 
340:245-6 ( 1 989)). Cohen ei al.. modifed this technology so that novel 
interactions between synthetic or engineered peptide sequences could be 
identified which bind a molecule of choice. The benefit of this type of 
technology is that the selection is done in an intracellular environment. The 
method utilizes a library of peptide molecules that anach to an acidic 
activation domain. A peptide of choice, for example an extracellular portion 
of memapsin 2 is attached to a DN A binding domain of a transcriptional 
activation protein, such as Gal 4. By performing the Two-hybrid technique 
on this type of system, molecules that bind the extracellular portion of 
memapsin 2 can be identified. 

Screening of Small Molecula Libraries 
In addition to these more specialized tecliniques, methodology well 
known to those of skill in tlie art. in combination with various small 
molecule or combinatorial libraries, can l>e used to isolate and characterize 
those molecules which bind to or interact with the desired target, either 
memapsin 2 or its substrate. The relative binding afTmily of these 
compounds can be compared and optimum inhibitors identified using 
competitive or non-competitive binding studies which are well known to 
those of skill in the art. Preferred competitive inhibitors are non- 
hydro lyzable analogs of memapsin 2. Another will cause allosteric 
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rearrangements which prevem memapsin 2 from functioning or folding 
correctly. 

rnm piitcr assisted R ational Drug Design 

Another way to isolate inhibitors is through rational design. This is 
achieved through structural information and computer modeling. Computer 
modeling technology allows visualization of the three-dimensional atomic 
structure of a selected molecule and the rational design of new compounds 
that will interact with the molecule. The three-dimensional construct 
typically depends on data from x-ray crysiallographic analyses or NMR 
imaging of the selected molecule. The molecular dynamics require force 
field data. The computer graphics systems enable prediction of how a ncw^ 
20 compound will link to the target molecule and allow experimental 

manipulation of the sUuctures of the compound and target molecule to 
perfect binding specificity. For example, using NMR specu-oscopy. Inouye 
and coworkers were able to obtain the structural information of N-terminal 
truncated TSHK (transmembrane sensor histidine kinases) fragments which 
retain ihe stniciure of the individual sub-domains of the catalytic site of a 
TSHK. On the basis of the NMR study, they were able to identify potential 
30 TSHK inhibitors (U.S. Patent No. 6.077.682 to Inouye). Another good 

example is based on the three-dimensional structure of a 
calcincurin/FKBPI2/FK506 complex determined using high resolution X-ray 
cr\ stallograpliy to obtain the shape and structure of both the calcineurin 
active site binding pocket and the auxiliary FKBP12/FK506 binding pocket 
(U..S. Patent No. 5,978,740 to Armistead). With this inlbrmaiion in hand, 
researchers can have a good understanding of the association of natural 
ligands or substrates with the binding pockets of their corresponding 
reccpioni or enzymes and are thus able to design and make effective 
inhibitors. 

Prediction of molecule-compound interaction when small changes arc 
made in one or both requires molecular mechanics software and 
computationally intensive computers, usually coupled with user-friendly, 
menu-driven interfaces between the molecular design program and the user. 
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Examples of molecuiar modeling systems are the CHARMm and QUANTA 
programs. Polygcn Corporaiion, Waltham. MA. CHARMm performs the 
energy minimization and moJecuJar dynamics functions- QUANTA 
performs the construction, graphic modeling and analysis of molecular 
structure. QUANTA allows imeraclive construction, modification, 
visualization, and analysis of the behavior of molecules with each other. 

A number of articles review computer modeling of drugs interactive 
with specific proteins, such as Rotivinen. ei al., Pharmaceutica 
Fcnnica91. 159-166: ^pVj^, New Sciemisi 54-57 (June 16, 1988); McKinaly 
and Rossmann. \9Z9Annu, Rev. Pharmacol. J oxicioL 29, 1 1 1-122: Perry 
and Davies, OSAR: Quantitative Structure-Activitv Relatio nships in Drug 
DcsiL^n pp. 189-193 (Alan R. Liss, Inc. 1989): Lewis and Dean, 1989 Proc. 
R. Soc. Lond. 236. 125-140 and 141-162; and. with respect to a model 
enzyme for nucleic acid components. Askew, et al.. 19897. Am. Chem. Soc. 
111.1 082-1090. Other computer programs that screen and graphically 
depict chemicals are available from companies such as BioDesign. Inc., 
Pasadena, CA.. Allelix, Inc, Mississauga, Ontario. Canada, and Hypercube, 
Inc.. Cambridge. Ontario. 

Although described above with reference to design and generation of 
compounds which could alter binding, one could also screen libraries of 
known compounds, including natural products or synthetic chemicals, and 
biologically active materials, including proteins, for compounds which alter 
substrate binding or enzymatic activity. 

Screeninu of Libraries 

Design of substrate analogs and rational drug design are based on 
knowledge of the active site and target, and utilize computer software 
programs that create detailed structures of die enzyme and its substrate, as 
well as ways they interact, alone or in the presence of inhibitor. These 
techniques are significantly enhanced with x-ray crystallographic data in 
hand. Inhibitors can also be obtained by screening libraries of exi.sting 
compounds for those which inhibit the catalytically active enzyme. In 
contrast to reports in the literature relating to memapsin 2, the enzyme 



described herein has activity analogoas to ihc naniraliy produced enzyme, 
providing a means for identifying compounds which inhibit the endogenoa<; 
activity. These potential inhibitors are typically identified using high 
throughput assays, in which enzyme, substrate (preferably a chromogenic 
substrate) and potential inhibitor (usually screened across a range of 
concentrations) are mixed and the extent of cleavage of subsu^le determined. 
Potentially useful inhibitors are those which decrease the amount of 
cleavage. 

III. Methods of diagnosis and treatnienr 

Inhibitors can be used in the diagnosis and treatment and/or 
prevention of Alzheimer's disease and conditions associated therewith, such 
as elevated levels of the forty-two amino acid peptide cleavage product, and 
the accumulation of the peptide in amyeloid plaques. 

Diagnostic U^es 

The substrate analogs can be used as reagents for specifically binding 
to memapsin 2 or memapsin 2 analogs and for aiding in memapsin 2 
isolation and purification or characterization, as described in the examples. 
The inhibitors and pmified recombinant enzyme can be used in screens for 
those individuals more genetically prone to develop Alzheimer s disease. 

Therapeutic Uses 

Recombinant human memapsin 2 cleaves a substrate with the 
sequence LVNM/AEGD (SEQ ID N0:9). This sequence is the in vivo 
processing site sequence of human prcsenilins. Both presenilin 1 and 
presenilin 2 are integral membrane proteins. They are processed by protease 
cleavage, which removes the N terminal sequence from the improcessed 
form. Once processed, presenilin forms a two-chain hcterodimer (Capell et 
al., J. Bioi- Chem. 273, 3205 (1998); Thinakaran ct al., Neurobiol. Dis. 4, 
438 (1998): Yu el al, Neurosci Lett. 2:254(3): 1 25-8 ( 1998)). which is stable 
relative to the unprocessed presenilins. Unprocessed preseni lines are quickly 
degraded (Thinakaran et ai., J. Biol. Chem. 272, 28415 (1997); Steincr et al.. 
J. Biol. Chem. 273. 32322 (1998)). It is known that presenilin controls the 
in vivo activity of beta-secretase, which in turn cleaves the amyloid precursor 



24 



protein (APP) leading to the formation of aipha-beia42. The accumulation of 
aipha-beia42 in the brain cells is knovin to be a major cause of Alzheimer's 
disease (for review, see Selkoe. 1998). The activity of preseniiin therefore 
enhances the progression of Alzheimer's disease. This is supported by the 
observation that in the absence of preseniiin gene, the production of alpha- 
heta42 peptide is lowered (De Strooper el al.. Nature 391 , 387 { 1 998)). 
Since unprocessed preseniiin is degraded quickly, the processed, 
heierodimeric preseniiin must be responsible for tlie accumulation of alpha - 
beta42 leading to Alzheimer's disease. The processing of preseniiin by 
memapsin 2 would enhance the production of alpha-beta42 and therefore, 
funher the progress of Alzheimer's disease. Therefore a memapsin 2 
inhibitor that crosses the blood brain barrier can be used to decrease the 
likelihood of developing or slow ihe progression of Alzheimer's disease 
which is mediated by deposition of alpha-beta42. Since memapsin 2 cleaves 
APP at the beta cleavage site, prevention of APP cleavage at the beta 
clcjvayie site will prevent the build up of alpha-|)eta42. 
Vaccines 

The caialytically active memapsin 2 or fragments thereof including 
lite active site defined by the presence of two catal>tic aspartic residues and 
substrate binding cleft can be used to induce an immune response to the 
memapsin 2. The memapsin 2 is administered in an amount effective to 
elicit blocking antibodies, i.e.. antibodies which prevent cleavage of the 
naturally occurring substrate of memapsin 2 in the brain. An unmodified 
vaccine may be useful in the prevention and treatment of Alzheimer's 
disease. The response to the vaccine may be influenced by its composition, 
such as inclusion of an adjuvant, viral proteins from production of the 
recombinant enzyme, and/or mode of administration (amount, site of 
administration, frequency of adminisnration. etc). Since it is clear that the 
enzyme must be properly folded in order to be active, antibody should be 
elicited that is active against the endogenous memapsin 2. Antibodies that 
are effective against the endogenous en7>'me are less hkely to be produced 
against the enzyme that is not properly refolded. 
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Phamiaceuiicallv Acceptable Carriers 

The inhibitors will typically be adminisiered orally or by injection. 
Oral administrdlion is preferred. Alternatively, other formulations can be 
used for delivery by pulmonary, mucosal or transdermal routes. The 
inhibitor will usually be adminisiered in combination with a 
pharmaceutical ly acceptable carrier. Pharmaceutical carriers are known to 
tiiosc skilled in the art. The appropriate carrier will typically be selected 
ba.sed on ihe mode of administration, Phannaccutical compositions may also 
include one or more aaive ingredients such as antimicrobial agents, 
antiinllammatory agents, and analgesics. 

Preparations for parenteral administration or administration by 
20 injection include sterile aqueous or non-aqueous solutions, suspensions, and 

emulsions. Examples of non-aqueous solvents are propylene glycol, 
polyethylene glycol, vegetable oils such as olive oil. and injectable organic 
esters such as ethyl oleate. Aqueotis carriers include water, 
alcoholic/aqueous solutions, emulsions or suspensions, including saline and 
buffered media. Preferred parenteral vehicles include sodium chloride 
solulion. Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's. 
30 or fixed oils. Intravenous vehicles include fluid and nutrient replenishers. 

and electrolyte replenishers (such as those based on Ringer's dexirose). 

Formulations for topical (including application to a muco.sal surface, 
including the mouth. pulmonar>'. nasal, vaginal or rectal) administration may 

35 

include ointments, lotions, creams, gels, drops, suppositories, sprays, liquids 
and powders. Formulations for these applications are known. For example, a 
number of pulmonary formulations have been developed, typically using spray 
^ drying to formulate a powder having particles with an aerodynanmic diameter 

of between one and three microns, consisting of dmg or drug in combination 
with polymer and/or surfactant. 

Compositions for oral administration include powders or granules, 
suspensions or solutions in water or non-aqueous media, capsules, sachets, or 
tablets. Thickeners, flavorings, diluents, emulsifiers. dispersing aids or binders 
may be desirable. 
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Peptides as described herein can also be administered as a 
pharmaceutically acceptable acid- or base- addition salt, formed by reaction 
with inorganic acids such as hydrochloric acid, hydrobromic acid, perchloric 
acid, nitric acid, thiocyanic acid, sulfuric acid, and phosphoric acid, and 
organic acids such as formic acid, acetic acid, propionic acid, glycolic acid, 
lactic acid, pyruvic acid, oxalic acid, malonic acid, succinic acid, maleic 
acid, and fiimaric acid, or by reaction with an inorganic base such as sodium 
hydroxide, ammonium hydroxide, potassium hydroxide, and organic bases 
such as mono-. di% trialkyi and aryl amines and substituted ethanolamines. 

Dosages 

Dosing is dependent on severit>' and responsiveness of the condition to 
be treated, but will normally be one or more doses per day, with course of 
treatment lasting from several days to several months or imtil the attending 
physician determines no further benefit will be obtained. Persons of ordinary 
skill can determine optimum dosages, dosing melhodologies and repetition 
rates. 

The dosage ranges are those large enough to produce the desired 
effect in which the symptoms of the memapsin 2 mediated disorder are 
alleviated (typically characterized by a decrease in size and/or number of 
amyloid plaque, or by a failure to increase in size or quantity), or in which 
cleavage of the alpha- beta42 peptide is decreased. Tlie dosage can be 
adiusied by the individual physician in the event of any coumerindications. 

Tlie present invention will be further understood by reference to the 
following non-limiting examples. 
Example 1. Cloning of memapsin 2. 

y. Cionin^ and nucleotide sequence of pro-memapsin 2. 

New sequences homologous to human aspanic proteases were found 
in the following entries in the EST IM.AGE database: AAl 36368 pregnant 
uterus ATCC 947471, AA20723 2 neurepithelium ATCC 214526, and 
R55398 human breast ATCC 392689. The corresponding bacterial strains: 
#947471. #214526, and # 392689 containing the EST sequences were 
obtained from the ATCC ( Rock vi lie, MD). The sequencing of these clones 
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obtained from ATCC confimied that they contained sequences noi identical 
to known human aspartic proteases. The completed sequences of these 
clones assembled into about 80% of prcpro-M2 cDNA. Full length cDNAs 
of these clones were obtained using the following methods. 

The Human Pancreas Marathon-Ready cDNA (Clontcch), which is 
double-strand cDNA obtained by reverse-transcription, primer addition, and 
second strand synthesize of mRNA from human tissues, was used as 
template for PCR amplification. An adapter primer (API ) and a nested 
adapter primer ( AP2) were used for 5'- and 3'-RACE PCR. For PCR the 5*- 
region of the mcmapsin 2 cDNA. primers API and NHASPRI were used. 
Primers for the 3'-end of the cDNA are NHASPF2 and API . The middle of 
the cDNA was amplified by primers NH ASPF I and NHASPR2. The 
sequence for the primers is as follows: NHASPFl: 
GGTAAGCA TCCCCCATGGCCCCAACGTC (SEQ ID NO: 10), 
NHASPRI: GACGTTGGGGCCATGGGGGATGCTTACC (SEQ ID 
NO: 11), 

NHASPF2: ACGTTGTCTTTGATCGGGCCCGAAAACGAATTGG (SEQ 
ID NO: 12), 

NHASPR2: CCAATTCGTTTTCGGGCCCGATCAAAGACAACG (SEQ 
1DN0:13). 

API: CCATCCrTAATACGACTCACTATAGGGC (SEQ ID N0:14), and 
AP2: ACTCACTATAGGGCTCGAGCGGC (SEQ 10 NO: 15) 

Memapsin 2 was also cloned from a human pancreas library (Quick- 
Screen Human cDNA Librar>' Panel) contained in lambda-gtlO and lambda- 
gil 1 vectors. The primers from the vectors. GTIOKWD. GTIOREV, 
G i l 1F^^T). and GTl I REV, were used as outside primers. The sequence of 
the primers used was: 

GTIOFWD: CTTrrGAGCAAGTTCAGCCl GGTTAA (SEQ ID N0:16). 
GTIOREV: GAGGTGGCTTATGAGTATTTCTTCCAGGGTA (SEQ ID 
NO: 17), 

GTl IFWD: TGGCGACGACTCCTGGAGCCCG (SEQ ID N0:18), 
GTl IREV: TGACACCAGACCAACTGGTAATGG (SEQ ID N0:19). 
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In addition, mcmapsin 2 cDNA was amplified directly from the 
human pancreatic lambda-gtlO and lambda-gtl I libraries. The sequence of 
the primers was: PASPN 1 : catatgGCGGCAGTGC rOCCTGCCCAC (SEQ 
lDNO:20)and 

NHASPCl ; ggatccTCACTTCAGCAGGGAGATGTCATCAGCAAAG 1 
(SEQ IDN0:2I). 

The amplified memapsin 2 fragments were cloned into an 
iS intermediate PGR vector (Invitrogen) and sequenced. 

The assembled cDNA from the fragments, the nucleotide and the 
deduced protein sequence are shown in SEQ ID NO 1 and SEQ ID NO 2. 
Pro-memapsin 2 is homologous to other human uspartic proteases. 

20 

Based on the alignments. Pro-memapsin 2 contains a pro region, an aspartic 
protease region, and a trans-mcmbranc region near the C-tcrminus. llie 
active enzyme is memapsin 2 and its pro-enzyme is pro-memapsin 2. 

2^ Example 2. Distribution of memapsin 2 in human tissues. 

Multiple tissue cDNA panels from Clontech were used as templates 
for PGR amplification of a 0.82 kb fragment of memapsin 2 cDNA. The 
primers used for memapsin 2 were NHASPFl and NHASPR2. Tissues that 

30 contain memapsin 2 or fragments of memapsin 2 yielded amplified PGR 

products. The amount of amplified product indicated that nteniapsin 2 is 
present in the following organs from most abundant to least abundant: 
pancreas, bniin, lung, kidney, liver, placenta, and heart. Memapsin 2 is also 
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present in spleen, prostate, testis, ovary, small intestine, and colon cells. 
Example Exprejision of pro-memapsin 2 cDNA in £. coli, refolding 
iind purification of pro-memapsin 2. 

40 Tlie pro-memapsin 2 was PGR amplified and cloned into the BamHl 

site of a pETl la vector. The resulting vector expresses pro-memapsin 2 
having a sequence from Ala-8p to Ala 326. Figure 1 shows the construction 
of two expression vectors, pETl 1 -memapsin 2-11 (hereafter Tl ) and pETl 1 - 
memapsin 2-12 (hereafter T2). In both vectors, the N-terminal 15 residues 
of the expressed recombinant proteins are derived from the expression 
vector. Pro-memapsin 2 residues sian at residue Ala- 16. The two 
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recombinanl pro-mcmapsin 2s have different C-tcrminal lengths. Clone Tl 
ends at Thr- 454 and clone T2 ends ai Ala-419. The Tl consmict contains a 
C-tcrminal extension from the T2 constnict but dc*cs not express any of the 
predicted transmembrane domain. 

Expression oj recombinant proteins and recovery of inclusion bodies 

The Tl and T2 expression vectors were separately transfected into E. 
coli strain BL21(DE3). The procedures for the culture of transfected 
bacteria, induction for synthesis of recombinant proteins and the recovery 
and washing of inclusion bodies containing recombinant proteins are 
essentially as previously described (Lin et al., 1994). 

Three different refolding methods have produced satisfactory results. 

fi) The rapid dilution method. 

Pro-memapsin 2 in 8 M urea/ 100 mM beta-mercaptocthanol with 
OD28unm = 5 was papidlv diluted into 20 volumes of 20 mM-'i ris, pH 9.0. 
The solution was slowly adjusted into pH 8 with 1 M HCI. The refolding 
solution was then kept at 4°C for 24 to 48 hours before proceeding with 
purification. 

(ii) The reverse dialysis method 

An equal volume of 20 mM Tris. 0.5 mM oxidized/1.25 mM reduced 
uhiiaihione, pll 9.0 is added to rapidly stirred pro-memapsin 2 in 8 M 
urea.' 10 mM bcta-mercaptoeihanoi with ODiBUnm = 5. The process is 
repeated three more times with 1 hour intervals. The resulting solution is 
then diaiyzed against sufTicieni volume of 20 mM Tris base so that the final 
urea concentration is 0.4 M. The pH of the solution is then slowly adjusted to 
K.O with 1 M HCI. 

The preferred method for refolding. 

Inclu.sion bodies are dissolved in 8 M urea. 0.1 M Tris, 1 mM 
Glycine, 1 mM EDTA. 100 mM beta-mcrcaptoethanoK pH 10.0. TheODzsh 
of (he inclusion bodies are adjusted to 5.0 witli the 8 M urea solution without 
bela-mercaptoethanol. i he fmal solution contains the following reducing 
reagents: 
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10 mM beta-mcrcaptocthanol. lOmM D lTCDithiothreitol), 1 niM reduced 
gluialhion. and 0.1 M oxidized glututhion. The final pH of the solution is 
10.0. 

The above solution is rapidly diluted into 20 volumes of 20 mM I ris 
base, the pH is adjusted to 9.0, and the resulting solution is kept at 4 for 
16 hr. The solution is equilibrated to room temperature in 6 hr. and the pH is 
adjusted to 8.5. The solution is returned to 4 "C again for 1 8 hr. 

The solution is again equilibrated to room temperature in 6 hr. and 
the pH is adjusted to 8.0. The solution is returned to 4 "C again for 4 to 7 
days. 

The refolding procedures are critical to obtain an enzymically active 
preparation which can be used for studies of subsite specificity of M2. to 
analyze inhibition potency of M2 inhibitors, to screen for inhibitors using 
either random structural libraries or existing collections of compound 
libraries, lo produce crystals for crystallography studies of M2 structures, 
and TO produce monoclonal or polyclonal antibodies of M2. 

Furification of recombinant pro^memapsin 2-72 

The refolded material is concentrated by ultrafiltration, and separated 
on a SEPHACRYL™ S-300 column equilibrated with 20 mM Tris.HCI, 0.4 
M urea, pH 8.0. The refolded peak (second peak) from the S-300 column 
can be further purified with a FPLC RESOURCE-Q'^''^ column, which is 
equilibrated with 20 mM Tris-HCI, 0.4 M urea, pH 8.0. The enzyme is clutcd 
from the column with a linear gradient of NaCl. The refolded peak from S- 
300 can also be activated before further purification. For activation, the 
fractions are mixed with equal volume 0.2 M Sodium Acetate, 70% glycerol. 
pH 4.0. The mixture is incubated at 22 **C for 1 8 hr. and then dialyzed iwrice 
against 20 volumes of 20 mM Bis-Tris. 0.4 M urea. pH 6.0. The dialyzed 
materials are then further purified on a FPLC RESOURCF-Q^'^' column 
equilibrated with 20 Bis-Tris. 0.4 M urea, pH 6.0. The enzyme is clutcd with 
a linear gradient of NaCl. 

SOS-PAGE analysis of the S-300 fractions under reduced and non- 
reduced conditions indicated that Pro-mem apsin 2 first elutes as a very high 
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molecular weight band (greaier lhan about 42 kD) under non-reduced 
conditions. This indicates that the protein is not folded properly in these 
fnaciions. due to disulfide cross linking of proteins. Subsequent fractions 
contain a protein of predicted pro-memapsin 2-T2 size (about 42 kDa). The 
pro-enzyme obtained in these fractions is also proleolytically aaive for auto- 
catalyzed activation. These fractions were pooled and subjected to 
chromatography on the FPLC RESOURCE™ column eluted with a linear 
gradient of NaCI. Some fractions were analyzed using SDS-PAGE under 
non- reducing conditions. The analysis showed that fractions 6 and 7 
contained most of the active proteins, which was consistent with the first 
FPLC peak containing the active protein. The main peak was coupled to a 
shoulder peak, and was present with repeated purification with the same 
RF.SOIJRCE^'*^ Q column. The main shoulder peaks were identified as 
active pro-memapsin 2 that exist in different conformations under these 
conditions. 

Example 4. Proteolytic actt>it>' and cleavage-site preferences of 
recombinant memapsin 2. 

The amino acid sequence around the proteolytic cleavage sites was 
determined in order to establish the specificity of memapsin 2. Recombinant 
pro-memapsin 2-Tl was incubated in 0.1 M sodium acetate, pH 4.0. for 1 6 
hours at room temperature in order to create autocatalyzed cleavages. The 
products were analyzed using SDS-polyacrylamide gel electrophoresis. 
Several bands which corresponded to molecular weights smaller than that of 
pro-memapsin 2 were observed . The electrophoreiic bands were trans-blotted 
onto a PVDF membrane. Four bands were chosen and subjected to N- 
terminal sequence determination in a Protein Sequencer. The N-tcrminal 
sequence of these bands established the positions of proteolytic cleavage 
sites on pro-memapsin 2. 

In addition, the oxidized B-chain of bovine insulin and two different 
synthetic peptides were used as substrates for memapsin 2 to determine the 
extent of other hydrolysis sites. These reactions were carried out by auto- 
activaied pro-memapsin 2 in 0. 1 M sodium acetate, pU 4.0, which was then 
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incubated with the peptides. The hydrolytic products were subjected to 
HPLC on a reversed phase C-1 8 cotumn and the eluent peaks were subjected 
to clcctrospray mass spectrometry for the determination of the molecular 
weight of the fragments. Two hydrolytic sites were identified on oxidized 
insulin B-chain (Table 1 ). Three hydrolytic sites were identified from 
peptide NCII-gamma. A single cleavage site was observed in synthetic 
peptide PS 1 -gamma, whose sequence (LVNMAEGD) (SEQ ID N0:9) is 
derived from the beta-processing site of human presenilin 1 (Table 1). 
Table 1: Substrate Specificity of Memapsin 2 
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Exumplc 5. Activation of pro-memapsin 2 and enzyme kinetics. 

Incuhation in 0.1 M sodium acelaic, pH 4.0, for 16 h al 22 X aulo- 
catalyiically converted /?ro-M2pd to M2pj. For initial iiydrolysis tests, two 
synthetic peptides were separately incubated uith /7/Y>-M2pd in 0.1 M Na 
acetate. pH 4.0 for different periods ranging from 2 to 1 8 h. The incubated 
samples were subjected to LC/MS for the identification of the hydrolyiic 
products. For kinetic studies, the identified HPLC (Beckman System Gold) 
product peaks were inlegraied for quantitation. The Kr> and kca* values for 
presenilin 1 and Swedish APP peptides ( Table 1 ) were measured by steady- 
state kinetics. The individual and kcai values for APP peptide could not 
be measured accurately by standard methods, so its kcat/Km value was 
20 measured by competitive hydrolysis of mixed substrates against presenilin 1 

peptide {Fershi, A. "Enz>'me Structure and Mechanism", 2"** Ed.. W.H. 
Freeman and Company, New York. (1 985)). 

The results are shown in Figures 2 A and 2B. The conversion of nro- 

25 

M2p(j at pH 4.0 to smaller fragments was shown by SDS-poIyacrylamide 
electro phoresis. The difference in migration between pro-h/i2p^ and 
converted enz>'me is evident in a mixture of the two. Figure 2A is a graph of 
30 the initial rale of hydrolysis of synthetic peptide swAPP (see Table 1 ) by 

M2,hi at different pH. Figure 2B is a graph* of the relative kca/Km values for 
steady-state kinetic of hydrolysis of peptide substrates by M2po. 
Example 6. Expression in Mammalian cells. 
Methods 

PM2 cDNA was cloned into the EcoRV site of vector pSecTag A 
(Inviirogen). Human APP cDNA was PCR amplified from human placenta 
4Q 8-gt 1 1 1 ibrary (Cloniech) and cloned into tlie Nhel and Xbal sites of pSccTag 

A. The procedure for transfcction into IleLa cells and vaccinia virus 
infection for T7-based expression are essentially the same as described by 
Lin. X.. FASEHJ. 7:1070-1080 (1993). 

Transfected cells were melabolically labeled with 2O0 microCi ^^S 
methionine and cysteine (TransLabel; ICN) in 0.5 ml of serum- 
free/metliionine-frcc media for 30 min. rinsed with 1 ml media, and replaced 
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v\ith 2 ml l)MHM/10% FCS. In order lo block vesicle acidification. 
Bafilomycin Al was included in the media (Perez, R.G.. ct al.. J Biol. Chem 
271 :9 1 00-9 1 07 (19%)). At different time points (chase), media was 
removed and the cells were harvested and lysed in 50 mM Tris. 0.3 M NaCl. 
5 mM EDTA. \% Triton X-100. pH 7.4. containing 10 mM iodoacetamide. 
10 :M TPCK, 10 :M TLCK. and 2 microg/ml leupcptin. The supernatant 
(14.000 X g) of cell lysates and media were immunoadsorbed onto antibody 
bound to protein G sepharose (Sigma). Anti-APP N-terminai domain 
antibody (Chemicon) was used to recover the betaN-fragment of APP and 
ami-aipha-beta i.|7 antibody (Chemicon. recognizing the N-terminal 17 
residues of alpha-beta) was used lo recover the 12 kDa 6 C-fragment. The 
former antibody recognized only denatured protein, so media was first 
incubated in 2 mM dithiothrieiol 0.1% SDS at SS'^C for 30 min before 
immunoabsorption. Samples were cooled and diluted with an equal volume 
of cell lysis buffer before addition of anii- APP N-ierminal domain 
(Chemicon). Beads were washed, cluted with loading buffer, subjected to 
SDS-PAGE (NOVEX™) and visuahzed by autoradiogram or 
phosphorimaging (Molecular Dynamics) on gels enhanced with Amplify 
(Amersham). Immunodetection of the betaN-fragment u'as accomplished by 
transblotting onto a PVDF membrane and detecting with anii-alpha-betai.p 
and chemiiuminesceni substrate (Amersham). 
Results . 

HeLa cells iransfected with APP or M2 in 4-well chamber slides were 
fixed with acetone for 10 min and penTieabiii7.ed in 0.2% Triton X-100 in 
PBS for 6 min. For localizing M2. polyclonal goal anii-/7n>-M2po antibodies 
were purified on DEAH-sepharose 6B and alTmiiy purified against 
recombinant /;ro-M2f»d immobilized on Affigel (BioRad). Purified anxi-pr<}- 
M2pj antibodies were conjugated to Alexa568 (Molecular Probes) according 
to the manufacturer's protocol. Fixed cells were incubated overnight with a 
1 : 100 dilution of antibody in PBS containing 0.1% BS A and washed 4 times 
with PBS. For APP. two antibodies were used. Antibody A B |.|7 (described 
above) and antibody A6 17-12. which recognizes the first 26 residues 
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following the bela-secrctase cleavage site (Chcmicon). Alter 4 PBS ^^•ashes. 
the cells were incubated overnight with an ami -mouse FITC conjuGaie at a 
dilution of 1 :200. Cells were mounted in Prolong anti-fade reagent 
(Molecular Probes) and visualized on a Leica TCS confocal laser scanning 
microscope. 

Example 7: Design and Synthesis of OM99-1 and OM99-2. 

Based on the results of specificity studies of memapsin 2. it was 
predicted that good residues for positions PI and PI' would be Leu and Ala. 
It was subsequently determined from the specificity data that PI ' preferred 
small residues, such as Ala and Ser. However, the crystal structure 
(determined below in Example 9) indicates that this site can accommodate a 
20 lot of larger residues. It was demonstrated that PI' of memapsin 2 is the 

position with ihe most stringent specificity requirement where residues of 
small side chains, such as Ala, Ser. and Asp. arc preferred. Ala was selected 
for P r mainly because its hydophobicity over Ser and Asp is favored for the 
penetration of the blood-brain barrier, a requirement for the design of a 
memapsin 2 inhibitor drug for treating Alzheimer's disease. Therefore, 
inhibitors were designed to place a transition-stale analogue isostere between 
30 Leu and Ala (shov^Ti as Leu* Ala, where * represents the transition-state 

isostere. -CH(0H)-CH2-) and the subsite P4, P3, P2. P2'. P3* and P4* are 
filled with the beta-secreiase site sequence of the Swedish mutant from die 
beia-amyloid protein. The structures of inhibitors OM99-1 and OM99-2 are 
shown below and in Figures 3A and 3B, respectively: 
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OM99-1 : VaI-Asn-Leu*Ala-Ala-Glu-Phe (SEQ. ID NO. 27) 

^ OM99-2: CiIu-VaI-Asn-Leu*Ala-Ala.Glu-Phe (SEQ. ID NO. 28) 

The Leu* Ala dipeptide isostere was synthesized as follows; 

The Leu- A la dipeptide isostere for tlie Mi-inhibiior was prepared 
from L- leucine. As showji in Scheme I , I .-leucine was protected as its BOC- 
derivaiive 2 by treatment with BOCiO in the presence of 10% NaOH in 
diethyl ether for 1 2 h. Boc-leucine 2 was then convened to Weinreb amide 3 
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by treatment with isobutyl chclorofonnate and N-methylpiperidine followed 
by treatment of the resulting mixed anhydride witii N,0- 
dimethylhydroxyjamine (Nahm and Weinreb, Tetrdhedron Letters 1 98 1 . 32, 
3815). Reduction of 3 with lithium aluminum hydride in diethyl elher 
provided the aldehyde 4. Reaction of the aldehyde 4 with lithium propiolate 
derived from the treatment of ethyl propiolate and lithium diisopropyl amide 
afforded the acetylenic alcohol 5 as an inseparable mixture of dia<nereomers 
^5 (5.8:1 ) in 42% isolated yield (Fray. Kaye and Kleinman. J. Org. Chem. 1 986. 

51. 4828-33). Catalytic hydrogenation of 5 over Pd/BaS04 followed by 
acid-catalyzed lactonization of the resulting gamma-hydroxy ester with a 
catalytic amount of acetic acid in toluene at reflux, furnished the gamma- 
lactone 6 and 7 in 73% yield. The isomers were separated by silica gel 
chromatography by using 40% ethyl acetate in hexane as the eluenl. 
Introduction of the methyl group at C-2 was accomplished by stereoselective 
aJkylalion of 7 with methyl iodide (Scheme 2). Thus, generation of the 

25 

dianion of lactone 7 with lithium hexamethyldisilazide (2.2 equivalents) in 
tcirahydrofuran at -78°C (30 min) and alkylation with methyl iodide (M 
equivalents) for 30 min at -78°C. followed by quenching with propionic 
30 , acid (5 equivalents), provided the desired alkylated lactone 8 (76% yield) 

along with a small amount (less tlian 5%) of the corresponding epimer 
(Ghosh and Fidanze, 1998 J. Org. Chem. 1998. 63.6146-54). Theepimeric 
c is- lactone wa.s removed by column chromatography over silica gel using a 
mixture (3:1) of ethyl acetate and hexane as the solvent system. The 
stereochemical assignment of alkylated lactone K was made based on 

extensive *H-NMR NOE experiments. Aqueous lithium hydroxide 
40 promoted hydrolysis of the lactone 8 followed by protection of the gamma- 

hydroxy I group with fm-butyldimethylsilyl chloride in the presence of 
imidazole and dimethylaminopyridinc in dimcthyifomiamide afforded the 
acid 9 in 90% yield after standard work-up and chromatography. Selective 
removal of the BOC-group was effected by treatment with trifluoroacetic 
acid in dichloromethane at 0°C for 1 h. The resulting amine salt was then 
reacted with commercial (Aldrich, Milwaukee) Fmoc-succinimide derivative 
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in dioxanc in the presence of aqueous Nal ICO^ lo provide the Fmoc- 
protecTed L* A isostere 1 0 in 65% yield after chromatography. Protected 
isostcre 10 was utilized in ihc preparation ofa random sequence inhibitor 
library-. 

Experimental procedure 

N-(tcrt'Butoxycurb(myt) Leucine (2) , 

To the suspension of 10 g (76.2 mmoi ) of L-leucine in 140 mL of 
^5 diethyl ether was added 80 mL of ! 0 % NaOH. After all solid dissolves, 20 

mL (87. 1 mmol) of BOCjO was added to the reaction mixture. The resulting 
reaction mixture was stirred at 23''C for 12 h. After this period, the layers 
were separated and the aqueous layer was acidified to pH I by careful 
addition of I N aqueous HCl at 0 **C- The resulting mixture was extracted 
with ethyl acetate (3 x 100 mL). The organic layers were combined and 
washed with brine and dried over anhydrous Na2S04. The solvent was 
removed under reduced pressure to provide title product which was used 
directly for next reaction without further purification (yield, 97 %). 'H NMR 
(4()() MHz, CDCI3) 6 4.89 (broad d. IH. J = 8.3 Hz), 4.31 (m. IH), 1.74- 
1 .49 (m. 3H). 1 .44 (s. 9H), 0,95 (d. 6H, J - 6.5 Hz). 
30 N-itcri-Biitoxycarbonyl)-L-leucinC'N -methoxy-N -methyla-miJe (3j. 

To a stirred solution of N,0-dimelhylhydroxy amine hydrochloride 
(5.52 g. 56.6 mmol) in dry dichloromethane (25 ml,) under Ni atmosphere 
al CC. -methylpiperidine (6.9 ml., 56.6 mmol) was added dropwise. The 
resulting mixture was .stirred al O^C for 30 min. In a .separate flask. N-(tcn- 
butyloxycarbonyl)-L-leucine ( 1 ) ( 1 1 .9 g. 51 .4 mmol) was dissolved in a 
mixture of TIIF (45 mL) and dichloromethane (180 mL) under N2 
40 atmosphere. The resulting solution was cooled to -20"C. l o this .solution 

was added 1 -metliylpipcridine (6.9 mL, 56.6 mmol) followed by isobulyl 
chloroformate (7.3 mL, 56.6 mmol). The resulting mixture was stirred for 5 
minutes at -20°C and the above solution of N.O-dimethylhydroxyamine was 
added to it. The reaction mixurc was kept -20 T for 30 minutes and then 
warmed to 23°C. The reaction was quenched with water and the layers were 
seperated. The aqueous layer was extracted with dichloromethane (3 x 1 00 
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mL). The combined organic layers were washed with 10% citric acid, 
saturated sodium bicarbonate, and brine. The organic layer was dried over 
anhydrous Na2S04 and concentrated under the reduced pressure. The residue 
was purified by flash silica gel chromatography (25% ethyl acetate/liexane) 
to yield the title compound 3 (13.8 g, 97%) as a pale yellow oil. *H NMR 
(400 MHz, CDCb) 5 5.06 (broad d, IH, J = 9.1 Hz), 4.70 (m, IH). 3.82 (s. 
3H). 3.13 (s, 3H), L70 (m, IH), 1.46-1.36 (m. 2H) 1.41 (s, 9H). 0.93 (dd. 
6H,J =6.5. 14.2 Ht:). 
N'ffert-Buwxycarhonyl)-L-Ieucmal (4). 

To a stirred suspension of lithium aluminum hydride (770 mg, 20.3 
mmol) in drv' diethyl ether (60 mL) at -40 "C under N2 atmosphere, was 
added N-tert-butyIoxycarbonyI-L-leucine-N*-methoxy-N'-methyIamide 
{5.05 g. 1 8.4 mmol) in diethyl ether (20 mL). The resulting reaction mixture 
was stirred for 30 min. After this period, the reaction was quenched with 
10% NaHS04 solution (30 mL). The resulting reaction mixture was then 
warmed to 23**C and stirred at that temperature for 30 min. The resulting 
solution was filtered and the filter cake was washed by two portions of 
diethyl ether. The combined organic layers were washed with saturated 
sodium bicarbonate, brine and dried over anhydrous MgS04. Evaporation of 
the solvent under reduced pressure afforded the title aldehyde 4 (3.4 1 g) as a 
pale yellow oil. The resulting aldehyde was ased immediately without 
further purification. 'H NMR (400 MHz. CDCI?) 6 9.5 (s, IH). 4.9 (s, IH). 
4.2 (broad m. IH). 1.8-1.6 (m. 211). 1.44 (s. 9H). 1.49-1.39 (m, 1H).0.96 
(dd.6IL J =2.7, 6.5 Hz). 

Ethyl (4SJS)-and (4RJS)-5-l0ert-Butoxycarhonyt)amim)]'-i'hydroxy-7- 
tnethyioa-2'ynaate (5), 

To a stirred solution of diisopropylamine (1.1 mL, 7.9 mmol) in dr>* 
THF (60 mL) at ()"C under N2 atmosphere, was added n-BuLi ( 1 .6 M in 
hexane. 4.95 mL. 7.9 mmol) dropwise. The resulting solution was stirred at 
0"C for 5 min and then warmed 10 23*^(* and stirred for 1 5 min. The mixture 
was cooled to -78''C and ethyl propiolate (801 ^L) in THF (2 mL) was added 
dropwise over a period of 5 min. The mixture was stirred for 30 min, after 
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which N-Boc-L-leucinal 4 (1 .55 g, 7.2 mmoJ) in 8 mL of diy THF was 
added. The resulting mixture was stin*ed at -78^^C for 1 h. After this period, 
the reaction was quenched with acelic acid (5 mL) in THF (20 mL). The 
reaclion mixure was warmed up lo 23^C and brine solution was added. The 
layers were separated and the organic layer was washed with saturated 
sodium bicarbonaie and dried over Na2S04. Evaporation of the solvent under 
reduced pressure provided a residue which was purified by flash silica gel 
15 chromatography ( 15 % ethyl acetate / hexane) to afford a mixture (3: 1 ) of 

acetylenic alcohols 5 (0.96 42 %). 'H NMR (300 MHz, CDClj) 5 4.64 (d. 
IH. J = 9.0 Hz), 4.44 (broad s. I H). 4.1 8 (m, 2H ), 3.76 (m, IH), 1.63 (m, 
IH). 1.43-1.31 (m.2H), 1.39(s. 9H). 1.29-1. 18(m. 311). 0.89 (m. 6H). 

20 

(5S.J'S)-5'fI ''[(ftrt'Butoxycarbonyl)ammo]-3 '-meihylhuTylJ 
-dihydrofuran-2(3H)-onc (7). 

To a stirred solution of the above mixture of acetylenic alcohols (1.73 
25 g. 5.5 mmol ) in ethyl acetate (20 mL) was added 5% Pd/BaSOj (1 g). The 

resulting mixture was hydrogenated at 50 psi for 1.5 h. After this period, the 
catalyst was filtered off through a plug of Celiie and the filtrate was 
concentrated under reduced pressure. The residue was dissolved in toluene 
(20 mL) and acelic acid ( 1 00 ^L). The reaction mixure was refluxed for 6 h. 
After this period, the reaclion was cooled to 23*'C and the solvent was 
evaporated to give a residue which was purified by tlash silica gel 
chromatography (40% diethyl ether / hexane) to yield the (5S, lS>uaimna- 

35 

lactone 7 (0.94 g. 62.8 and the (5R. 1 S>gamma-lactone 6 (0. 1 6 g. 1 0.7 %). 
Lacionc 7: 'H NMR (400 MHz, CDCIi) 6 4.50-4.44 (m, 2H), 3.84-3.82 (m, 
1H).2.50(1.2H,J =7.8 Hz), 2.22-2.10 (m.2H), 1.64-1.31 (m,3H), 1.41 (s, 
-'O 9H}. 0.91 (dd. 6H. J = 2.2, 6.7 Hz); "C NMR (75 MHz, CDCI3) S 177.2. 

156.0. 82.5, 79.8. 51.0. 42.2. 28,6. 28.2. 24.7. 24.2, 23.0.21.9. 
(3R.5S.I 'S)-5-fl '-[(tert'Butoxycarhonyl)amwo)J-3 '-methylhut-yl]-3-methyl 
dihyJrofuran-2(3H)-fm€ (H). 

To a stirred solution of the lactone 7 (45 1 .8 mg, 1 .67 mmol) in dry 
THF (8 mL) at -78"C under Ni atmcsphcre. was added lithium 
hexamethyldisilazanc (3.67 mL, 1 .0 M in THF) over a period of 3 min. The 
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resuJting mixture was stirred at -78*^C for 30 min to generate the lithium 
enoiate. After this period, McI (228 ^L) v\'as added drop wise and the 
resulting mixture was stirred at -78*C for 20 min. The reaction was 
quenched with saturated aqueous NH4CI solution and was allowed to warm 
10 23*'C. The reaction mixture was concentrated under reduced pressure and 
the residue was extracted with ethyl acetate (3 x 1 00 ml.)- The combined 
organic layers were washed with brine and dried over anhydrous NajSOj. 
Evaporation of the solvent aflbrded a residue which was purified by silica 
gel chromatography (1 5 % ethyl acetate / hcxane) to furnish the alkylated 
lactone 8 (0.36 76 %) as an amorphous solid. 'H NMR (300 MHz, 
CDCI3) 5 4.43 (broad t, IH, J = 6.3 Hz), 4.33 (d, IH, J = V.6 Hz), 3.78 (m, 
20 1H).2.62 (m. 1H).2.35 (m, IH). 1.86 (m, IH). 1.63-K24 (m, 3H). 1.37 (s, 

9H). 1 .2 1 (d, 3H- J = 7.5 Hz). 0.87 (dd. 6H, J = 2.6. 6.7 Hz); '^C NMR (75 
M Hz. CDCI.0 6 1 80.4, 1 56.0, 80.3. 79.8. 5 1 ,6, 4 1 .9. 34,3, 32.5, 28.3, 24.7. 
23 23.0.21.8. 16.6. 

(2R. 5S)-5-f 0crt-ButoxycarbonyI)amino]-4-l(i€rt-huryld{meth 
-yhilyljoxy J- 2. 7-dimethyloctanoic acid (9). 

To a stirred solution of lactone 8 (0.33 g, 1.1 7 mmol ) in THF (2 mL) 
30 was added 1 N aqueous LiOH solution (5.8 mL). The resulting mixture was 

stirred at 23°C for 10 h. Aficr this period, the reaction mixture was 
concentrated under reduced pressure and the remaining aqueous residue was 
cooled to 0"C and acidified with 25% citric acid solution to pH 4. The 
resulting acidic solution was extracted with ethyl acetate (3 x 50 ml.). The 
combined organic layers were washed with brine, dried over Na3S04 and 
concentrated to yield the corresponding hydroxy acid (330 mg) as a while 
40 foam. This hydroxy acid was used directly for the next reaction without 

flinher purification. 

To the above hydroxy acid (330 mg. 1.1 mmol) in anhydrous DMF 
was added imidazole ( 1 .59 g, 23.34 mmol) and ten- 

buiyldimethylchlorosilane (1 .76 g, 1 1,67 mmol). The resulting mixture was 
stirred at 23''C for 24 h. After this period, MeOH (4 mL) was added and the 
mixture was stirred for I h. The mixure was diluted with 25% citric acid (20 
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mL) and was extracted with ethyl acetate (3 x 20 mL). The combined 
extracts were washed with water, brine and dried over anhydrous Na2S04- 
Evaporation of the solvent gave a viscous oil which was purified by flash 
chromatography over sihca gel (35% ethyl acetate / hexane) to afford the 
silyl protected acid 9 (0.44 g, 90 %). IR (neat) 3300-3000 (broad ), 2955. 
2932. 2859, 1711 cm''; 'H NMR (400 MHz, DMSO-d^ 343 K) delta 6.20 
(broad s, 1 H). 3.68 (m. III), 3.51 (broad s, IH). 2.49-2.42 (m, IH), 1.83 (i, 
IH. J = lO.I Hz), 1.56 (m, IH), 1.37 (s,9H), 1.28-1.12 (m,3H), 1.08 (d. 3H, 
J = 7.1 Hz). 0.87 (d, 3H. J = 6.1 Hz) 0.86 (s. 911), 0.82 (d. 3H. J = 6.5 Hz). 
0.084(5.311). 0.052 (s, 3H). 

(2R,4S.5S}-5'l(iluorenyimetMoxycarbonyl}amwo]-4-[{ten'hutyU^^ 
20 sHyl}oxy]'2,7'dimeihyloctanoic add (W). 

To a stirred solution of the acid 9 ( 0. 1 7 g , 0.4 1 mmol) in 
dichloromethane (2 mL) at 0"C was added trifluoroacctic acid (500 ^L). The 
result! nu mixture was stirred at O^C for I h and an additional portion (500 

25 

\xD of irifluoroacetic acid wa.s added to the reaction mixture. The mixture 
was stirred for an additional 30 min and the progress of the reaction was 
monitored by TLC. After this period, the solvents were carefully removed 
30 under reduced pressure at a baih temperature not exceeding 5"C. The residue 

was dissolved in dioxane (3 mL) and NaHCO.i (300 mg) in 5 mL of IbO. To 
this solution was added Fmoc-succinimide (166.5 mg. 0.49 mmol) in 5 mL 
of dioxane. The resulting mixture was stirred ai 23 for 8 h. The mixure 
was then diluted with HjO (5 mL) and acidified with 25% aqueous citric acid 
to pH 4. The acidic solution was extracted with ethyl acetate (3 x 50 mL). 
The combined extracts were washed with brine, dried over Na2S04 and 
concentrated under reduced pressiu^ to give a viscous oil residue. 
Purification of the residue by flash chromatography over silica gel afforded 
the Fmoc-protected acid 10 (1 37 ing. 61%) as a white foam. 'H NMR (400 
MHz. DMSO-d'*. 343 K) 5 7.84 (d. 2H. J = 7.4 Hz). 7.66 (d. 2H. J =8 Hz). 
7.39 (t. 2H. J = 7.4 Hz). 7.29 (m, 2H). 6.8 (s. IH). 4.29-4.19 (m. 3H), 3.74- 
3.59 (m.2H). 2.49 (m. Ill), 1.88 (m. IH). 1.58 (m. IH), 1.31-1.1 7 (m, 3H), 
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MO (d, 3H. J = 7.1 ll2). 0-88 (s, 9H). 0.82 (d, 6H. J = 6.2 Hz). 0.089 (s, 3 
H), 0.057 (s. 3H). 

The synihesis of OM99-1 and OM99-2 were accomplished using 
solid Slate peptide synthesis procedure in which Leu* Ala was incorporated in 
the fourth step. The synthesized inhibitors were purified by reverse phase 
HPLC and their structure confirmed by mass spectrometry. 
Example 8. Inhibition of Meioapsin 2 by OM99-1 and OM99-2. 
^2 Enzyme activity was measured as described above, but with the 

addition of either OM99- 1 orOM99-2. OM99-1 inhibited recombinant 
niemapsin 2 as shown in Figure 5A. The Ki calculated is 3 x lO"** M. The 
substrate used was a synthetic fluorogenic peptide substrate. Tlie inhibition 
of OM99-2 on recombinant memap.<iin 2 was measured using the same 
fluorogenic substrate. The Ki value was determined to be 9.58 x 10"^ M. as 
shown in Figure 5B. 

These results demonstrate that the predicted subsile specificity is 
accurate and that inhibitors can be designed based on the predicted 
specificity. 

The residues in PI and FT are very important since the M2 inhibitor 
30 must penetrate the bJood-brain barrier (BBB). The choice of Ala in PI' 

facilitates the penetration of BBB. Analogues of Ala side chains will also 
work, for example, in addition to the methyl side chain of Ala. substituted 
methyl groups and groups about the same size like methyl or ethyl groups 

35 

can be substituted for the Ala side chain. Leu at PI can also be substituted 
by groups of similar sizes or with substitutions on Leu side chain. For 
penetrating the BBB. it is desirable to make the inhibitors smaller. One can 
therefore use OM99- 1 as a starting point and discard the outside subsiies P4, 
P3. P3' and P4* . The retained structure Asn-Leu*Ala-Ala (SEQ ID NO:29) is 
then further evolved w-ith substitutions for a tight-binding M2 inhibitor 
which can also penetrate the BBD. 

Fxampic 9. Crystallization and X-ray diffraction study of the protease 
domain of human memapsin 2 complexed to a specifically designed 
inhibitor. OM99-2. 
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'I he cTystallizalion condition and preliminan- x-ray diffraction data 
on recombinant human memapsin 2 complexed lo OM99-2 were determined. 
Production of Recombinant Memapsin 2 

About 50 mg of recombinant memapsin 2 was purified as described 
in Example 3. For optimal crystal growth, memapsin 2 must be highly 
purified. Memapsin 2 was over-expressed from vector pETl la-M2pd. This 
memapsin 2 is the zymogen domain which includes the pro and catalytic 
domains to the end of the C- terminal extension but docs not include the 
transmembrane and the intracellular domains. The vector was transfected 
into E. coli BL21 (DE3) and plated onto ZB ayar containing 50 mg/Iiter 
ampicillin. A single colony was picked to inoculate 100 ml of liquid ZB 
containing 5 mg ampicillin and cultured at 30 "C. for 18 hours, with shaking 
at 220 RPM. Aliquots of approximately 1 5 ml of the overnight culture were 
used lo inoculate each 1 liter of LB containing 50 mg of ampicillin. Cultures 
were grown at 37 "C. with shaking ai 1 80 RPM. until an optical density at 
600 nm near 0.8 was attained. At that lime, expression was induced by 
addition of 1 1 9 mg of IPTG to each liter of culture. Incubation was 
continued for 3 additional hours posi-induclion. 

Bacteria were harvested, suspended in 50 mM Tris. 150 mM NaCl. 
pH 7.5 (T>; buffer), and ly.sed by incubation with 6 mg lysozyme for 30 
minutes, followed by freezing for 1 8 hours at -20 "C. Lysate was thawed and 
made to 1 mM MgCb then 1000 Kunitz units of DNAse were added wiUi 
stirring, and incubated for 30 min. Volume was expanded to 500 ml with 
containing 0.1 % Triton X-100 (TNT buffer) and lysate stirred for 30 
minutes. Insoluble inclusion bodies containing greater than 90% memapsin 
2 protein were pelleted by centrifugation. and washed by rcsuspension in 
TN 1 with stirring for 1-2 hours. Following three additional TNT washes, the 
memapsin 2 inclusion bodies were dissolved in 40 ml of 8 M urea, I mM 
EDTA. 1 mM glycine. 100 mM Tris base. 100 mM beta-mercaptocthanlol (8 
M urea buffer). Optical density at 280 run was measured, and volume 
expanded with 8 M urea buffer to achieve final O.D. near 0.5, with addition 
of sufficient quantity of beta-mcrcapioethanol to attain 10 mM total, and 10 
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mM DTT, 1 mM reduced glutathione, 0.1 mM oxidized glutathione. The pH 
of the solution ^\'as adjusted to 10.0 or greater, and divided into four aliquois 
of 200 ml each. Eauh 200 mi was rapidly-diluted into 4 liters of 20 mM Tris 
base, with rapid stirring. The pH was adjusted immediately to 9.0, with 1 M 
HCl. and stored at 4 ^'C overnight. The following morning the diluted 
memapsin 2 solution was maintained at room temperature for 4-6 hours 
followed by adjusting pH to 8.5 and replacing the flasks lo the 4 room. 
The same procedure was followed the next day with adjustment of pH to 8.0. 

This memapsin 2 solution was allowed to stand at 4 "C for 2-3 weeks. 
The total volume of approximately 16 liters was concentrated to 40 mis using 
ultra- filtration (Millipore) and stir-cells (Amicon). and cenirifuged at 
140.000 xg at 30 minutes in a rotor pre-equilbratcd to 4 *'C. The recovered 
supernatant was applied to a 2.5 x 100 cm column of S-300 equilibrated in 
0.4 M urea. 20 mM Tris-HCI, pH 8.0, and eiuted with the same buffer at 30 
ml/hour. TTie active fraction of memapsin 2 was pooled and further purified 
in a FPLC using a 1 ml Resource-Q (Pharmacia) column. Sample was 
tillered, and applied lo the Rcsourcc-Q column equilibrated in 0.4 M urea. 50 
mM Tris-HCl. pH 8.0. Sample was eiuted with a gradient of 0 - 1 M NaCl in 
the same bufTer, over 30 ml at 2 ml/min. The elucnts containing memapsin 2 
appeared near 0.4 M NaCl which was pooled for crystallization procedure at 
u concentration near 5 mg^^ml. 

The ami no-terminal sequence of the protein before cr>'stallizaiion 
showed two sequences starting respectively at residues 28p and 30p. 
Apparently, the pro peptide of recombinant pro-memapsin 2 had been 
cleaved during the preparation by a yet unidentified proteolytic activity. 

The activation of the folded pro-enzyme to mature enzyme, 
memapsin 2, was carried out as described above, i.e., incubation in 0.1 M 
sodium acetate pH 4.0 for. 16 hours at 22 ''C. Activated enzyme was further 
purified using anion-exchange column chromatography on Resource-Q anion 
exchange colunan. The purity of the enzyme \^^s demonstrated by SDS-gel 
electrophoresis. At each step of the purification, the specific activity of the 
enz>'me was assayed as described above lo ensure ihe activity of the enzyme. 
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Preiiminan' Cn'frtailization with OM99-2 

Crystal trials were performed on purified memapsin 2 in complex 
with a substrate based transition-state inhibitor OM99-2 with a Ki =^ 10 nM. 
OM99-2 is equivalent to eight amino-acid residues (includini: subsites S4. 
S3, S2. SI sr. S2\ S3' and S4' in a sequence FVNLAAEF) with the 
substitution of the peptide bond between the SI and SI ' (L-A) by a 
transition-state isostere hydroxyethylene. Purified M2 w^as concentrated and 
mixed with 10 fold excessive molar amount of inhibitor. The mixture was 
incubated at room temperature for 2-3 hour:; to optimize the inhibitor 
binding. The crystallization trial was conducted at 20 **C using the hanging 
drop vapor diffusion procedure. A systemic search vdlh various 
20 crystallization conditions was conducted to find the optimum crystallization 

conditions for memapsin 2/OM99-2 inhibitor complex. For the first step, a 
coarse screen aimed at covering a wide range of potemial conditions were 
carried out using the Sparse Matrix Crystallization Screen Kits purchased 
from Hampton Research. Protein concentration and temperature were used 
as additional variables. Conditions giving promising (micro) crystals werx; 
subsequently used as starting points for optimization, using fine grids of pH. 
30 prccipitants concentration etc. 

Crystals of mcmapsin-inhibitor complex were obtained at 30% PEG 
8000. 0.1 M NaCocadylatc, pH 6.4. SDS gel electrophoresis of a dissolved 
crx'.stal verified that the content of the crystal to be memapsin 2. Several 

35 

single crystals (with the sizes about 0.3 mm x 0.2 mm x 0.1 mm) were 
carefully removed from the cluster for data collection on a Raxis IV image 
plaie. The.se results showed thai the crysiuls dilTract to 2.6 A. A typical 
protein diffraction pattern is shown in Figure 6. An X-ray image 
visualization and integration softwarc-Denzo, was used to visualize and 
index the diffraction data. Denzo identified that the primitive orthorhombic 
lattice has the highest symmetry with a significantly low di.stortion index. 
The unit cell parameters were determined as: a=89.1 A, b=96.6 A, c=134.] 
A. a=(J=7=90". There are two memapsin 2.'OM99-2 complexes per 
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CTYSiallographic asymmeiric unit, the V„, of the crystal is 2.9 A^/Da. 
Diflraction extinctions suggested that the space group is P2i2i2i. 

With diffraction of the current crx'Stal to 2.6 A. die cr>'stal structure 
obtained from these data has the potential to reach atomic solution, i.e.. the 
three-dimensional positions of atoms and chemical bonds in the inhibitor 
and in memapsin 2 can be deduced. Since memapsin 2 sequence is 
homologous with other mammalian aspartic proteases, e.g., pepsin or 
cathepsin D, it is predicted that the three dimensional structures of 
memapsin 2 will be similar (but not identical) to their structures. Therefore, 
in the determination of x-ray structure from the diffraction data obtained 
from the current crystal, it is likely the solution of the phase can be obtained 
20 from the molecular replacement method using the known cr>'.stal structure 

of aspartic proteases as the search model. 
Further Crystallization Studies 

Concentrated memapsin 2 was mixed with lO-fold molar excessive 
of the inhibitor. The mixture was incubated at room temperature for 2-3 
hours to optimize inhibitor binding, and then clarified with a 0.2 micron 
filler using centrifugation. Cr>'stals of memapsin 2-inhibitor complex were 
30 grown at 20 "C by hanging drop vapor difFiision method using equal 

volumes of enzyme-inhibitor and well solution. Crystals of quality suitable 
for diffraction studies were obtained in two weeks in 0. 1 M sodium 
cacodylate, pH 7.4. 0.2 M (NH4)2S04. and 22.5% PEG8(K10. The typical 

35 

size of the cry stals was about 0.4 x 0.4 x 0.2 mm \ 

Diffraction data were measured on a Raxis-IV image plate with a 
Rigaku X-ray generator, processed with the HKL program package [Z. 
Otwinowski. W. Minor, Methods Enzymol. 276. .107 (1 997)1 A single 
crystal of approximately 0.4 x 0.4 x 0.2 mm"* in size was treated with a cryo- 
proicction solution of 25% PEG8000. 20% glycerol. O.I M sodium- 
cacodylaie pli 6.6. and 0.2 M (NHj^SO.: and then flash-cooled with liquid 
nitrogen to about -1 KO "C for data collection. Diffraction was obser\'ed to 
at least 1.9 A. Tlie crystal form belongs to space group P2t with two 
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memapsin 2/OM99-2 complexes per crystallographic asymmelric unit and 
56% solvenl conienl. 

Molecular replacement was performed with data in the range of 
15.0-3.5 A using program AmoRc. CCP4 package [Navaza, J., Acta 
Cr>'StaIlog. Sect. A. 50, 157 (1994)]. Pepsin, a human aspartic protease 
with 22% sequence identity, was used as the search model(PDB id Ipsn). 
Rotation and translation search, followed by rigid body refinement, 
identified a top solution and positioned both molecules in the asymmetric 
unit. The initial solution had a correlation coefficient of 22% and an R- 
facior of 0.51 . The refinement was carried out using the program CNS 
[Brungcr et al.. Acta Crystallogr. Sect. D, 54. 905 ( 1 998)). 1 0% of 
reflections were randomly selected prior to refinement for Rt>« monitoring 
[Brugcr. A.T., X-PKOR Version 3.1: A system for X-ray Crystallography 
and NMR, Yale University Press, New Haven. CT (1992)]. Molecular 
graphics program [Jones, T.A., et al.. Improved methods for building 
protein models in electron denisty maps and location of errors in these 
models. Acta Crs'Stallogr. Sect. A 47, 110 ( 1991 )] was used for map display 
and model building. From the initial pepsin model, corresponding amino 
acid residues were changed to that of memapsin 2 according to sequence 
alignment. The side chain conformations were decided by the initial 
electron densit>' map and a rotomer libran'. This model was refined using 
molecular dynamics and energy minimization function of CNS [Bruger. 
A.T.. et al.. Acta Ciystallogr. Sect. D, 54. 905 (1998)]. The first cycle of 
refinement dropped the Rworimg to 41% and the Rtree to 45%. At this stage, 
electron densities in the omit map clearly showed the inhibitor configuration 
in the active site cleft. Structural features unique to memapsin 2 in chain 
tracing. secondar>' structure, insertions, deletions and extensions (as 
compared to the search model) are identified and constructed in subsequent 
iterations of crystallographic refinement and map fitting. The inhibitor was 
built into the corresponding electron density. 

About 440 solvent molecules were then gradually added to the 
structure as identified in the |Fo|-|Fc| map contoured at the 3 sigma level. 
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Non-crystallographic symmetry restriction and averaging were used in early 
stages of refinement and mode! building. Bulk solvent and anisotropic 
over-all B factor corrections were applied through the refinement. The final 
structure was validated by the program PROCHECK. Laskowski. R.A. et al., 
J. Appl. Cr>'stallog. 26, 283 (1 993) which showed that 95% of the residues 
are located in the most favored region of the Ramachandran plot. All the 
main chain and side chain parameters are within or belter than the standard 
criteria. The final Rwofims and Ruec arc 18% and 22% respectively. 
Refinement statistics are listed in Table 2. 
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Table 2. Data Collection and RefinemeDt Statistics 



A. Data Statistics 




Space group 


P2, 


Unit Cell (a, and c in A) 


53.7, 85.9» 1092 


(a & and yin degrees) 


90.0,101.4.90.0 


Resolution (A) 


25.0-1.9 


Number of observed reflections 


144,164 


Number of unique reflections 


69,056 


Rfncryc 


0.061 (0.25) 


Data completeness (%) (2S.0-1.9A) 


90.0 (68.5) 


<I/«<I)> 


13.7(3.0) 


o. ivennemcni diausucs 




Rworfcinc 


0.186 




0.228 


RMS deviation from ideal values 




Bond length (A) 


0.014 


Bond angle (Deg) 


1.7 


Number of water molecules 


445 


Average B-factor (A^) 




Protein 


28.5 


Solvent 


32.2 



■ Rfnerse = 21; | Ihu. i - <Ihu> I / 5iki <Ihki>, whcTC Ihki. i is the intensity of 
the ith measurement and <lhki> is the weighted mean of all measurements 

Oflhkl- 

Rworking (free) = S | Fo| - 1 Fc| /£ | FJ. whcrc F© and Fe are the observed and 
calculated structure factors. Numbers in parentheses are the 
corresponding numbers for the highest resolution shell (2.00-1.9 A). 
Reflections with Fo/o(Fo)>=^0.0 arc included in the refinement and R 
factor calculation. 
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Mcmapsin 2 Crystal Structure. 

The biJobal sUucture of mcmapsin 2 (Figure 7) is characteristic of 
aspartic proteases (Tang. J., et al.. Nature 271. 618-621 (1978)) with the 
consen'ed folding of the globular core. The substrate binding cleft where 
the inhibitor is bound (Figure 7), is located between the two lobes. A pscudo 
two-foid symmetry between the N- (residues 1-180) and C- (residues 181- 
385) lobes (Figure 7). which share 61 superimposablc atoms with an overall 
2.3 A rms deviation using a 4 A cutoff. The corresponding numbers for 
pepsin are 67 atoms and 2.2 A. Active-site A.sp^' and Asp^' and the 
surrounding hydrogen-bond network are located in the center of the cleft 
(Figure 7) and are conserved with the typical active-site conformation 
(Davies. D. R.. .Annu. Rev. Biophys. Chem. 19. 189 (1990)). The active site 
carboxyis arc. however, not co-planar and the degree of which (50") exceeds 
those observed previously. 

Compared to pepsin, the conformation of the N-Iobe is essentially 
conserved (Sieiccki et al., 1990). The most significant structural differences 
are the insertions and a C-terminal extension in the C-lobc. Four insertions 
in helices and loops (Figure 7) are located on the adjacent molecular 
surface, insertion F. which coniains four acidic residues, is the most 
negatively charged surface on the molecule. Together, these insertions 
cnlariicd significantly the molecular boundar\' of memapsin 2 a.s compared 
to pepsin (Figure 8). These surface structural changes may have function in 
the association of mcmapsin 2 with other cell surface components- 
Insertions B and E are located on the other side of the molecule (Figure 7). 
Tlic latter contains a bela-slrand that paired with part of the C-terminal 
extension G. A six- residue deletion occurs at position 329 on a loop facing 
the flap on the opposite side of the aclive-site cleft, resulting in an 
apparently more accessible cleft. Most of the C-terminal extension (residues 
55^-393 ) is in highly ordered structure. Residues 369-376 form a beta 
structure with 7 hydrogen bonds to strand 293-299. while residues 378-383 
form a helix (Figures 7 and 8). Two disulfide pairs (residues 1 55/359 and 
217/382) unique to mcmapsin 2 fasten both ends of the extension region to 
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ihc C-lobe. This C-lenniiial extension is much longer than those observed 
previously and is contbnnatioiially diff erent [Cutfield. S. M.. et al., 
Simcmre 3, 1261 (1995): Abad-Zapaiero. C, ei al.. Protein Sci. 5. MO 
(1996); Symersky. J. ct al.. Biochemistry 36, 12700 (1997); Yang, J., et al.. 
Acta Crystallogr- D 55, 625 (1999)]. The last eight residues (386-393) are 
not seen in the electron density map; they may form a connecting stem 
between the globular catalytic domain and the membrane anchoring 
domain. 

Of the 21 putative pro residues only the last six, 43p-48p. are visible 
in the electron dcnsit>' map. The remainders arc likely mobile. Pro- 
memapsin expressed in mammalian cell culture has an N-ierminus position 
at Glu^'''*. However, an Arg-Arg sequence present at residues 43p-44p is a 
frequent signal for pro-protein processing, e.g.. in prorerin (Corvol. P. ct al. 
Hypertension 5, 13-9 (1 983)). Recombinant memapsin 2 derived from this 
cleavage is ftilly active. The mobility of residues 28p-42p suggests that they 
arc not part of the structure of mature memapsin 2. 

Memapsin 2-OM99-2 Interaction. 

TIic binding of the eight-residue inhibitor OM99-2 in the active-site 
clefi shares some structural features with other aspartic protease-inhibitor 
complexes [Davies, D.R.. Annu. Rev. Biophys. Chem. 19. 189 (1990); 
Bailey and Cooper, (1994); Dealwis et al.. (1994)]. These include four 
hydrogen bonds between the two active-site aspanics lo the hydroxyl of the 
transition-state isoslere. the covering of the flap (residues 69-75) over the 
central pan of the inhibitor and ten hydrogen bonds to inhibitor backbone 
(Figure 9). Most of the latter are highly conser\'ed among aspartic proteases 
[Davies. D. R. Annu. Rev. Biophys. Chem. 19 J 89 (1990); Bailey and 
Cooper. ( 1 994): Dealwis ct al., ( 1 994)] except that hydrogen bonds to Gly ' * 
and Tyr"*'' are unique to memapsin 2. These observations illustrate that the 
manner by which memapsin 2 transition-state template for substrate peptide 
backbone and mechanism of catalysis are similar to other aspartic proteases. 
These common features are, however, not the decisive factors in the design 
of specific memapsin 2 inhibitors with high selectivity. 
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The observation important for the desien of inhibitor drugs is that 
the memapsin 2 residues in contact witli individual inhibitor side chains 
(Figure 9) are quite ditlerent from those for other aspartic proteases. These 
side chain contacts are important for the desit!n oftiyhl binding inhibitor 
with high selectivity. Five N-terminal residues of OM99-2 arc in extended 
conformation and. with the exception of P|' Ala. all have clearly defined 
contacts (within 4 A of an inhibitor side chain) with enzyme residues in the 
active-site cleft (Figure 9). 

The protease S4 subsite is mostly hydrophilic and open to solveni. 
The position of inhibitor P4 G!u side chain is defined by hydrogen bonds to 
Cly " and to Asn (Figure 9) and the nearby sidechains of Arg^' and 
Arg which explains why the absence of this residue from OM99-2 cause 
a lO-foId increase in K;, Likeunse, the protease S2 subsite is relatively 
hydrophilic and open to solvent. Inhibitor P2 Asn side chain has hydrogen 
bonds to Pa Glu and Arg^-'\ The relatively small S2 residues Ser^^^ and 
Ser^-' (Gin and Met respectively in pepsin) may fit a side chain larger than 
Asn, Memapsin 2 Si and S3 subsites, which consist mostly of hydrophobic 
residues, have conformations ver>' different from pepsin due to the deletion 
of pepsin helix hni (Dcalwis, et aL. (1994)). The inhibitor side chains of P? 
Val and P| Leu are closely packed against each other and have substantial 
hydrophobic contacts with the enzyme (Figure 9), especially ?y interacts 
with Tyr" and Phe'"**. In the beta- sccrcta.se site of native APP. the Pi and 
Pi residues are Lys and Met respectively. Swedish nuitant APP has Ann and 
Leu in these positions respectively, resulting in a 60-foid increase of kcai/K,„ 
over that for native APP and an early onset of AD described by MuUan. M.. 
el al. [Nat. Genet. 2. 340 (1992)]. The current structure suggests that 
inhibitor P2 Lys would place its positively charge in an unfavorable 
interaction with Arg^^^ with a loss of hydrogen bond to Arg~^^ while Pi Met 
would have less favorable contact with memapsin 2 than does leucine in this 
site (Figure 10), No close contact with memapsin 2 was seen for Pi' Ala and 
on aspartic at this position, as in APP, may be accommodated by interacting 
with Arg^". 
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I he direction of inhibitor chain turns at P?' and loads P3' and 
toward the protein surface (Figure 10). As a result, the side-chain position 
of P2' Ala deviates from the regular extended conformation. The side chains 
of Pi* Glu and P4' Phc are both pointed toward molecular surface with linle 
significant interaction with the protease (Figure 10). The relatively high B- 
faaors (58.2 A " for Glu and 75.6 A ^ for Phe) and less wcll-defmed electron 
density suggests that these two residues are relatively mobile, in contrast to 
the defined structure of the S3' and S4' subsites in renin-inhibitor (CH-66) 
complex (Deal wis et al.. 1994). The topological ly equivalent region of these 
renin subsites (residues 292 - 297 in pepsin numbering) is deleted in 
memapsin 2. These observations suggest that the conformation of three C- 
terminal residues of OiM99-2 may be a funaional feature of memapsin 2, 
possibly a way to lead a long protein substrate out of the active-site cleft. 
Example 10: Using The Cr>stal Structure 10 Design Inhibitors. 

Pharmaceutically acceptable inhibitor drugs normally post a size limit 
under 800 daJtons. In the case of memapsin 2 inhibitors, this requirement 
may even be more stringent due to the need for the drugs to penetrate the 
blood-brain barrier [Kearney and Aweeka, (1999)]. In the current model, 
well defmed subsitc structures spending P4 to P2' provide sufficient template 
areas for rational design of such drugs. The spacial relationships of 
individual inhibitor side chain with the corresponding subsitc of ihc enzyme 
as revealed in this cr>'Stal structure permits the design of new inhibitor 
structures in each of these positions. It is also possible to incorporate the 
unique conformation of subsites P2', P^* and Pa' into the selectivity of 
memapsin 2 inhibitors. The examples of inhibitor design based on the current 
cr>'Slal structure are given below. 

Example A: Since the side chains of P3 Val and P| I,eu arc packed against 
each other and there is no enzyme structure between them, cross- linking 
these side chains would increase the binding strength of inhibitor to 
memaspin 2. This is because when binding to the enz>'mc. the cross-linked 
inhibitors would have less entropy difference between the free and bound 
forms than their non-cross-linked cotmterparis [Khan. A.R., et al.. 
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Biochemistry. 37. 16839 (1998)]. Possible structures of the cross-linked 
side chains include those shown in Figure 1 1. 

Example B: The same situation exits between the P4 Glu and P2 Asn. The 
current crystal structure shows that these side chains are already hydrogen 
bonded lo each other so the cross linking between them would also derive 
binding benefit as described in the Example A, The cross-linked structures 
include those shown in Figure 12. 

Example C: Based on the current crystal structure, the PT Ala side chain 
may be extended to add new hydrophobic. Van der Waals and H-bond 
interactions. An example of such a design is diagramed in Figure 13. 
Example D: Based on the current crystal structure, the polypeptide 
backbone in the region of P2, and P3. and the side chain of PI -Leu can 
be bridged into rings by the addition of two atoms (A and B in Figure 14). 
Aho. a methyl group can be added to the beta-carbon of the PI -Leu (Figure 
14). 
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We claim: 

1 - Purified recombinant catalydcally active memapsin 2. 

2. The memapsin 2 of claim 1 having die amino acid sequence 
of SEO ID NO. 2 or the sequence present in a homologous species. 

3. The memapsin 2 of cliani 2 of human origin and having the 
amino acid sequence of SEQ ID NO. 2. 

4. The memapsin 2 of claim 1 not including the iransmembnine 
domain. 

5. The memapsin 2 of claim 1 expressed in a bacteria. 

6. The memapsin 2 of claim J cleaving SEVKM/DAEFR ( SEQ 
ID K0:4) and SEVNUDAEFR (SEQ ID NO:5) at pH 4.0 with k^n/Km of less 
than or equal to 39.9 s''M'' and less than or equal to kc«, 2.45 s*', K,„. 1 mM: 
kc:,i/K,„. 2450 s 'M''. respectively. 

7. A method for producing catalytically active recombinant 
memapsin 2 comprising refolding the recombinant memapsin 2 under 
conditions which dissociate and then slowly refold the enzyme into a 
catalytically active form. 

8. The method of claim 7 wherein the memapsin 2 is first 
dissolved in 8 M urea solution including one or more reducing agents at a pH 
of greater than 8.0. 

9. The method of claim 8 wherein the memapsin 2 is then 
diluted into an aqueous buffer like 20 niM-Tris. pH 9.0. the pH slowly 
adiu.<;ted to approximately 8 with I M HCI. and the solution maintained at 
low temperature for approximately 24 to 48 hours before proceeding with 
purification. 

10. The method of claim 8 wherein the memapsin 2 is then 
rapidly mixed witli an aqueous buffer like 20 mM-Tris. pH 9.0. containing 
oxidized and reduced glutathione, the process repeated, then the urea 
concentration decreased lo approximately 0.4 M and the pH of the solution 
slowly adjusted to 8.0. 

1 1 . The method of claim 8 wherein the memapsin 2 is dissolved 
in 8 M urea, pH 10.0, then rapidly diluted into an aqueous buffer like 20 mM 
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Tris base, pH 9.0, and maintained at low temperature several hours, 
maintained at room temperature for several hours, and then the process 
repealed at decreasing pH. 

12. A method of isolating inhibitors of cleavage by memapsin 2 
comprising 

adding to one or more potential inhibitors caialytically active 
recombinant memapsin 2 cleaving SEVKM/DAEFR (SEQ ID N0:4) and 
SE\TslL/DAEFR (SEQ ID NO:5) at pH 4.0 with k^ai/K^ of less than or equal 
to 39.9 s'*M'' and less than or equal to k«„ 2.45 s'\ Knu 1 mM; WICit.. 2450 
s''M'\ respectively, and a substrate for memapsin 2, and 

screening for decreased cleavage of the sub.<;traie by the inhibitors. 

13. The method of claim 1 2 wherein the inhibitors are in a library 
of small synthetic molecules. 

1 4. The method of claim 1 2 wherein the inhibitors are compounds 
selected from the group consisting of proteins and peptides. 

15. The method of claim 12 wherein the recombinant memapsin 2 
is expressed in genetically engineered cells and the inhibitors and substrate 
arc added to the cells. 

1 6. The method of claim 1 5 wherein the inhibitors arc 
oligonucleotides preventing or decreasing expression of catalyiically active 
memapsin 2. 

1 7. The method of claim 14 wherein the compounds are isosteres 
of the memapsin 2 active site defined by the presence of two catalytic 
aspanic residues and substrate binding cleft. 

1 8. The method of claim 1 2 further comprising identifying the 
inhibitors maximally decreasing cleavage of substrate by the memapsin 2. 

19. A method for designing or obtaining inhibitors of catalytically 
active memapsin 2 comprising modeling an inhibitor based on the 
cr>'stallization coordinates of memapsin 2 or parameters of Tabic 2. 

20. The method of claim 19 comprising using a computer 
program to model a compound to determine its binding to the memapsin 2 
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aclive site defined by the presence of iwo caialyiic aspurtic residues and 
sub5nrate binding cleA. 

21. Tlie method of claim 19 comprising using a computer 
program to design a compound which binds to the memapsin 2 active siie 
defmed by the presence of two catalytic aspartic residues and substrate 
binding cleft. 

22. The method of claim 19 further comprising screening 
compounds which bind to the aciive site defmed by the presence of two 
catalytic aspartic residues and substrate binding cleft for inhibition of 
memapsin 2 catalytic activity'. 

23. A data base comprising binding properties and chemical 
structures of compounds designed or screened by modeling an inhibitor 
based on the crystallization coordinates of memapsin 2 or parameters of 
Table 2. 

24. A method of Urating or preventing Alzheimer's disease 
comprising administering administering to a patient in need thereof an 
inhibitor of memapsin 2 which binds to the active site of tlie memapsin 2 
defmed by the presence of two catalytic aspartic residues and substrate 
binding cleft. 

25. The method of claim 24 wherein the inhibitor has an Ki of less 
ihun or equal to 10'^ M. 

26. The method of claim 24 wherein the inhibitor is selected from 
the group consisting of proteins, peptides, oligonucleotides, and small 
synthetic molecules. 

27. Tlie method of claim 24 wherein the inhibitor is modeled 
based on the cr>'stallizaiion coordinates of memapsin 2 or parameters of 
Table 2. 

28. A crystallized memapsin 2 comprising the active site defmed 
by the presence of two caialytic aspartic residues and substrate binding cleft. 

29. The memapsin 2 of claim 28 not including the transmembrane 
and/or intracellular domains. 
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5 30. The memapsin 2 of claim 28 hav ing the parameters defined in 

Table 2. 

3 1 - The memapsin 2 of claim 28 having a diffraction to at least 
3.5 A or less, 

32. The memapsin 2 of claim 3 1 having a diffraction to at least 2 
A or less. 

33. A method for ircaimem or preventing Alzheimer's disease 
comprising immunizing an individual in need thereof with catalytically 
active memapsin 2 cleaving SEVKM/DAEFR (SEQ ID NO:4) and 
SEVNL/DAEFR (SEQ ID N0:5) at pH 4.0 with kc„,/K„, of less than or equal 
to 39.9 s'm' and less than or equal to k««, 2.45 s\ K^, 1 mM; kcai/K„„ 2450 

20 s*'M*'. respectively, to elicit an effective amount of antibodies to reduce 

cleavage by endogenous mamapsin 2. 
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<210> 1 

<211> 3252 

<212> SNA 

c2X3> Beano sapiens 

<400> 1 

gcgggagtgc tgcctgccca cggcacccag cacggcatcc ggctgcccct gcgcagcggc 60 
ctggggggcg cccccctggg gctgcggctg ccccgggaga ccgacgaaga gcccgaggag 120 
cccggccgga ggggcagctt tgtggagatg gtggacaacc tgaggggcaa gtcggggcag 180 
ggctactacg tggagatgac cgtgggcagc eccccgcaga cgctcaacat cctggtggat 240 
acaggcagca gtaactttgc agtgggtgct gccccccacc ccttcctgca tcgctactac 3 GO 
cagaggcagc tgtccagcac ataccgggac ctccggaagg gtgtgtatgt gccctacacc 360 
cagggcaagt gggaagggga gctgggcacc gacctggtaa gcatccccca tggccccaac 420 
gtcactgtgc gtgecaacat tgctgccatc actgaatcag acaagttctt catcaacggc 480 
tccaactggg aaggcatcct ggggctggcc tatgctgaga ttgccaggcc tgacgactcc 540 
ctggagcctt tctttgactc tctggtaaag cagacccacg ttcccaacct cttctccctg 600 
cagctttgtg gtgctggctt ccccctcaac cagtccgaag tgctggcctc tgtcggaggg 660 
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agcatgatca ttggaggtat cgaccactcg 
atccggcggg ag&ggtatta tgaggtgatc 
ctgaaaatgg actgcaagga gtacaactat 
aaccttcgtt tgcccaagaa agtgttcgaa 
tccacggaga agttccctga tggtttctgg 
ggcaccaccc cttggaacat ttccccagtc 
aaccagtcct tccgcatcac cacccttccg 
gccacgtccc aagacgactg ttacaagttt 
atgggagctg ttatcatgga gggcttctac 
ggcttcgctg tcagcgcttg ccatgtgeac 
ccttttgtca ccttggacat ggaagactgt 
accctcacga ccatagccta cgtcatggct 
cgcctcacgg tgtgtcagtg gcgctgcctc 
gctgatgaca tccccctgct gaagtgagga 
ggaccacacc tccgtggttc actttggtca 
cagagcacct caggaccctc cccacccacc 
ggctggcaag gtgggttcca gggactgtac 
cactctgctg gcgggaatac tcttggtcac 
cgaaacttca gccctgaacc tttgtccacc 
cctcttctct tagcttcaga agtactggca 
tgtggtaccc tggcagagaa gagaccaagc 
gaggatgcac agtttgctat ttgcbttaga 
ggtgcaaaga tfcgcctcttg aatt.aaaaaa 
gcggccggaa agaggagaag gagagggagt 
aggaaaggca gaaacacaac cactcaccag 
catcccatct cagaagatgg gtgttgtttt 
accaaaagtg agatgggaag ggcttatcta 
tgaagtgccc accaagaagt tccact.taac 
tctctatctg aaccaccctt tattctacat 
cctaagcccc aggtgccctg tgggagagca 
tcctggtcat aggctcactc cctcceccaa 
ctaaaaggaa taggtaggag acctcttcta 
tcattcaaca gctgatgccc cataacccct 
agtagcaaga tctttacata attcagagtg 
cccctccatt tatttgacta aagcatcrca 
agaaatacag tgccttatgg ctccaacatt 
aaggatggca gcctcagggc tcccttatgt 
catctttctc ccccatcccg tccctcccct 
gctggtcctt: gggctaggta gtggggacca 
agtaaaccac ggtaccagtg tcagtgggaa 
tcctactcct acctggtcaa cccgctgctt 
aectgataag ggagagggaa atacaaggag 
gcccmcaagc cataaaccaa taaamcaaga 
aaaaaaaaaa aa 



ctgtacacag gcagfcctctg gtatacaccc 720 
at:tgt:gcggg tggagatcaa tggacaggat 780 
gaeaagagea ttgtggacag tggcaccacc 840 
gergcagtca aatccaccaa ggcagcctcc 900 
eraggagagc agctggtgtg ccggcaagca 960 
acctcactct acctaatggg tgaggttacc 1020 
cagcaatacc tgcggccagt ggaagatgt-g 1080 
gccatctcac agtcatccac gggcacCgtt 1140 
gttgtcttt:g atcgggcccg aaaacgaatt 1200 
gatgagttca ggacggcagc ggtggaaggc 1260 
ggctacaaca ttccacagac agatgagtca 1320 
gccatctgcg ccctcttcat gctgccactc 1380 
cgctgcctgc gccagcagca tgatgacttt 1440 
ggcccatggg cagaagatag agattcccct IS 00 
caagtaggag acacagatgg cacctgtggc 1560 
aaatgcctct gccttgatgg agaaggaaaa 1620 
ctgtaggaaa cagaaaagag aagaaagaag 16 BO 
ctcaaattt:a agccgggaaa ctctgctgct 1740 
atccctttaa attctccaac ccaaagtatt 1800 
tcacacgcag gttaccttgg cgtgtgtccc 1860 
ttgtttccct gctggccaaa gtcagtagga 1920 
gacagggact gtataaacaa gcctaacatt 1980 
aaactagatt gactatttat acaaatgggg 2040 
acaaagacag ggaatagtgg gatcaaagct 2100 
tcctagtttt agaccccatc tccaagatag 2160 
caatgctttc ttttctgtgg ttgcagcctg 2220 
gccaaagagc tcttttttag ctctcttaaa 2280 
acatgaatct ccgccatatt aatttcattg 2340 
atgataggca gcactgaaat atcctaaccc 2400 
actggactat agcagggctg ggctctgtct 2460 
atcttcctct ggagctttgc agccaaggtg 2520 
tctaatcctt aaaagcataa tgttgaacat 2580 
gcctggattt cttcctatta ggctataaga 2640 
gtttcattgc cttcctaccc tctctaatgg 2700 
cagcggcact agcattatac caagagtatg 27 60 
actgccttca gtatcaaggc tgcctggaga 2820 
cctccaccac aagagctcct tgatgaaggt 2860 
ccccgctcct aatggtacgt gggtacccag 2940 
agttcattac ctccccatca gttctagcat 3000 
gagctgggtt ttcctagtat acccactgca 3060 
ccaggtatgg gacctgctaa gtgtggaatit 3120 
ggcctctggt. gttcccggcc tcagccagct 3180 
atactgagtc taaaaaaaaa aaaaaaaaaa 3240 

3252 



<210> 2 
<211> 488 
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<2a2> PRT 

<213> Homo sapiens 
<220> 

<223> Puri£ied Memapsln 2 
<220> 

<223> Amino Acids 28-48 are remnant putative propeptide 
residues 

<220> 

<223:> Amino Acids 58-61, 78, 80, 82-83, llff, 116-121, 
156, 166, 174, 246, 274, 276, 278-281, 283« and 
376-377 are residues in contact with the OM99-2 
inhibitor 

<220> 

<223> Amino acids 54-57. 61-68^ 73-80, 86-89, 109-111, 
113-118, 123-134, 143-154, 165-166, 198-202, and 
220-224 are N-lobe Beta Strands 

<220> 

<223> Amino Acids 184-191 and 210-217 are N-lobe Helices 
<220> 

<223> Amino acids 237-240, 247-249, 251-256, 259-260, 
273-275, 282-285, 316-318, 331-336, 342-348, 
354-357, 366-370, 372-375, 380-383, 390-395, 
400-405, and 418-420 are C-lobe Beta Strands 

<220> 

<:223> Amino Acids 286-299, 307-310, 350-353, 384-387. 
and 427-431 are C-lobe Helices 

<400> 2 

Ala Gly Val Leu Pro Ala His Gly Thr Gin Hie Gly lie Arg Leu Pro 
15 10 15 

Leu Arg Ser Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg Leu Pro Arg 
20 25 30 

Glu Thr Asp Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val . 
35 40 45 

Glu Met Val Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val 
50 55 60 

Olu Met Thr Val Gly Ser Pro Pro Oln Thr Leu Asn llo Leu Val Asp 
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65 70 75 80 

Thr Gly Ser Ser Asn Phe Ala Val Oly Ala Ala Pro His Pro Pbe Leu 
85 90 95 

His Arg Tyr Tyr Gin Arg Gin Ueu- Ser Ser Thr Tyr Arg Asp I#eu Arg 
100 105 110 

Lys Gly Val Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu l*eu 
115 130 125 

Gly Thr Asp I*eu Val Ser He Pro Bis Oly Pro Asn Val Thr Val Arg 
130 135 140 

Ala Asn He Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn Gly 
145 150 155 160 

Ser Asn Trp Glu Gly He Leu Gly Leu Ala Tyr Ala Glu He Ala Arg 
165 170 175 

Pro Asp Asp Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr 
leo IBS 190 

ais Val Pro Asn Leu Phe Ser Leu Gin Leu Cys Gly Ala Gly Phe Pro 
195 200 205 

Leu Asn Gin Ser Glu Val Leu Ala Ser Val Gly Gly Ser Met He He 
210 215 220 

Gly Gly He Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr Pro 
235 230 235 240 

He Arg Arg Glu Trp Tyr Tyr Glu Val He He Val Arg Val Glu He 
245 250 355 

Asn Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys 
260 265 270 

Ser He Val Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys Val 
275 280 285 

Phe Glu Ala Ala Val Lys Ser He Lys Ala Ale Ser Ser Thr Glu Lys 
290 295 300 

Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala 
305 310 315 320 

Gly Thr Thr Pro Trp Asn He Pbe Pro Val He Ser Leu Tyr Leu Met 

4 
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325 330 335 

Gly Glu Val Thr Han Gin Ser Phe Arg lie Tbr He Leu Pro Gin Gin 
340 345 350 

Tyr Leu Arg Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cye Tyr 
355 360 365 

Lys Phe Ala He Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val 
370 375 380 

He Met Glu Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg He 
385 390 395 400 

Gly Phe Ala Val Ser Ala Cys His Val His Asp Glu Phe Arg Thr Ala 
405 410 415 

Ala Val Glu Gly Pro Phe Val Thr Leu Asp Met Glu Asp Cys Gly Tyr 
420 425 430 

Asn He Pre Gin Thr Asp Glu Ser Thr Leu Met Thr He Ala Tyr Val 
435 440 445 

Met Ala Ala He Cys Ala I«eu Phe Met Leu Pro Leu Cys Leu Met Val 
450 455 460 

Cys Gin Trp Arg Cys Leu Arg CyB Leu Arg Gin Gin Bis Asp Asp Phe 
465 470 475 480 

Ala Asp Asp He Ser Leu Leu Lys 
485 



<210> 3 

<211> 503 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Pro-memapsin 2 
<220> 

<223> Amino Acids 1-15 are vector-derived residues 
<220> 

<223> Amino Acids 16-64 are a putative pro peptide 



<220> 
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<223> Amino Acld5 1-13 are t^he T7 proanoter 
<220> 

<223> Amino Acids 16-456 are Pro-memapsin 2-Tl 
<220> 

<223> Amino Acids 16-421 are Promemapsin 2-T2 
<400> 3 

Ket Ala Ser Met Tbr Gly Gly Gin Gin Met. Gly Xrg Gly Ser Met Ala 
IS 10 15 

Gly Val Leu Pro Ala Ris Gly Tbr Gin His Gly He Arg Leu Pro Leu 
20 25 30 

Arg Ser Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg Leu Pro Arg Glu 
35 40 45 

Tbr Asp Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu 
50 55 60 

Met Val Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu 
65 70 75 BO 

Met Thr Val Gly Ser Pro Pro Cla Thr Leu Asn He Leu Val* Asp Thr 
85 90 95 

Gly Ser Ser Asn Phe Ala Val Gly Ala Ala Pro Ris Pro Phe Leu Ris 
100 105 110 

Arg Tyr Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys 
115 120 125 

Gly Val Tyr Val Pro Tyr Thr Gin Gly^ Lys Trp Glu Gly Glu Leu Gly 
130 135 140 

Tbr Asp Leu Val Ser lie Pro Bis Gly Pro Asa Val Thr Val Arg Ala 
145 150 155 160 

Asn He Ala Ala lie Thr Glu Ser Asp Lys Phe Phe lie Asn Gly Ser 
165 170 175 

Asn Trp Glu Gly He Leu Gly Leu Ala Tyr Ala Glu He Ala Arg Pro 
180 185 190 

Asp Asp Ser Leu Glu Pro Pbe Pbe Asp Ser Leu Val Lys Gin Thr Ris 
195 200 205 
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Val Pro Xsn Leu Phe Ser Leu Gin Leu Cys Gly A1& Qly Phe Pro Leu 
210 215 220 

Asn Gls Ser Glu Val Leu Ala Ser Val Oly Gly Ser Met lie lie Gly 
225 230 335 340 

Oly lie Asp His Ser Leu Tyr Thr Sly Ser Leu Trp Tyr Thr Pro lie 
345 350 255 

Arg Arg Glu Trp Tyr Tyr Glu Val lie lie Val Arg Val Glu lie Asn 
260 265 270 

Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyx Asn Tyr Asp Lys Ser 
275 380 265 

lie Val Asp Ser Oly Thr Thr Asn Leu Arg Leu Pro Lys Lys Val Phe 
290 295 300 

Glu Ala Ala Val Lys Ser lie Lys Ala Ala Ser Ser Thr Glu Lys Phe 
305 310 315 320 

Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala Gly 
325 330 335 

Thr Thr Pro Trp Asn lie Phe Pro Val He Ser Leu Tyr Leu Met Gly 
340 345 350 

Glu Val Thr Asn Gin Ser Phe Arg He Thr He Leu Pro Gin Gin Tyr 
355 360 365 

Leu Arg Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys 
370 375 380 

Phe Ala He Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val He 
385 390 395 400 

Met Glu Gly Phe Tyr Val Val Pbe Asp Arg Ala Arg Lys Arg He Gly 
405 410 415 

Phe Ala Val Scr Ala Cys His Val His Asp Glu Phe Arg Thr Ala Ala 
420 425 430 

Val Glu Gly Pro Phe Val Thr Lou Asp Met Glu Asp Cys Gly Tyr Asn 
435 440 445 

He Pro Gin Thr Asp Glu Ser Thr Leu Met Thr He Ala Tyr Val Met 
450 455 460 
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Xla Ala He Cys Ala Leu Phe Met Leu Pro Leu Cys Leti Met Val Cys 

470 475 480 

Gin Trp Arg Cys Leu Arg Cya Leu Arg Gin Gin Bis Asp Asp Phe Ala 
485 490 495 

Asp Asp He Ser Leu Leu Lys 
500 

<210> 4 
<211> 10 
<312> PRT 

<213> Artificial Sequence 
<220> 

<223> Description o£ Artificial Sequence: Primer 
<40D> 4 

Ser Glu Val Lys Met Asp Ala 61u Phe Arg 
15 10 

<210> 5 
<2I1> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> S 

Ser Glu Val Asn Leu Asp Ala Glu Phe Arg 
15 10 

<210> 6 
<311> 8 
<212> PRT 

<213> Arti£icial Sequence 
<220> 

<223> Description o£ Artificial Sequence: Synthetic 
Peptide 

<400> 6 

Ser Val Asn Met Ala Glu Cly Asp 
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1 5 



<210> 7 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description o£ Artificial Sequence i Synthetic 
Peptide 

<400> 7 

Lys 61y Gly Val Val lie Ala Thr Val lie Val Lys 
IS 10 



<210> 8 
<211> 4 
<212> PRT 

c213> Homo sapiens 
<400> B 

ABp TJir Ser Gly 
1 



c210> 9 
<211> e 
<:212> PRT 

<213> Homo sapiens 
<400> 9 

Leu Val Asn Met Ala Glu Gly Asp 
1 5 



<210> 10 
<211> 28 
<212> ONA 

<213> Artificial Sequence 
<220> 

<223> Description o£ Artificial Sequence: Primer 
<400> 10 

ggtaagcatc ccccatggcc ccaacgtc 2B 
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<310> 11 
<311> 3B 
<313> SKA 

<313> Artificial Sequence 
<230> 

<323> Description of Artificial Sequence: Primer 
<400> 11 

gacgttgggg ccatggggga tgcttacc 

<210> 12 
<211> 34 
<212> DKA 

<213> Artificial Sequence 
<230> 

<223> Description of Artificial Sequence: Primer 
<4O0> 12 

acgttgtctt tgatcgggcc cgaaaacgaa ttgg 

<210> 13 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Primer 
<400> 13 

ccaattcgtt ttcgggcccg atcaaagaca acg 

<210> 14 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence i Primer 
<400> 14 

ccatcctaat acgactcact atagggc 
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<210> 15 
<2I1> 23 
<212> DKA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequences Primer 
<400> 15 

actc aetata gggctcgagc ggc 

<310> 16 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence i Primer 
<400> 16 

cttttgagca agttcagcct ggttaa 

<210> 17 
<211> 31 
c212> DKA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 17 

gaggtggctt atgagtattt cttccagggt a 

<210> 18 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description ot Artificial Sequence: Primer 
<400> 18 

tggcgacgac tcctggagcc eg 
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<210> 19 
<211> 24 

c2i2> znzx 

<213> Artificial Sequence 
<220> 

c223> Description o£ Artificial Sequences Primer 
c400> 19 

tgacaccaga ccaactggta atgg 

<:210> 20 
<211> 27 
<212> DHA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence i Primer 
<400> 20 

catatggcgg gagtgctgcc tgcccac 

<210> 21 
c211> 38 
<212> BHA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence! Primer 
<400> 21 

ggatcctcac ttcagcaggg agatgtcatc agcaaagt 

<210> 22 
<211> e 
c212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oxidized 
Insulin B- chain 

<220> 

<223> Xaa at site 3 represents cysteic acid 
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<40Q> 22 

His Leu Xaa Oly Sor Eis Leu Val 
1 5 

<210> 23 
<211> 8 
<212> PRT 

<:213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oxidired 
Insulin B- chain 

<230> 

<2 23> Xaa at site 1 represents cysteic acid 
<400> 33 

Xaa Gly Glu Arg Oly Phe Phe Tyr 
1 5 

<210> 24 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 24 

Val Gly Ser Gly Val 
1 5 

<210> 25 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 25 

Val Gly Ser Gly Val Leu Leu 
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1 5 



<210> 26 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description o£ Artificial Sequence t Synthase 
Peptide 

<400> 26 

Gly Val Leu Leu Ser Arg Lys 
1 5 



<210> 27 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Inh-ihitors 
<400> 27 

Val Asn Leu Ala Ala Glu Phe 
1 5 



<210> 28 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Inhibitors 
<4D0> 28 

Glu Val Asn Leu Ala Ala Glu Ptaa 
1 S 



<210> 29 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<323> Description of Artificial Sequence: Synthetic 
Peptide 

<40O> 29 
Asn Xfeu Ala Ala 
1 

<210> 30 
<211> 10 
<312> PRT 

<213> Artificial Sequence 
<230> 

c223> Description of Artificial Sequence: Synthetic 
Peptide 

<400> 30 

Val Gly Ser Gly Val Leu Leu Ser Arg Lys 
1 5 10 

<210> 31 

<211> 326 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Amino acids 2-5. 6-5, 13-20. 25-32. 55-67, 69-74, 
79-87. 89-91. 99-106, 119-122. lSO-154, 164-167. 
180-183, 191-194. 196-199, 201-204. 210-214, 
221-223. 258-262. 265-269, and 275-278 are Beta 
Strands 

<220> 

<223> Amino acids 281-2B4, 2B6-2B8, 298-301, 310-315, 
and 319-324 are Beta strands 

<220> 

<223> Amino acids 48-51, 111-114, 136-142. 225-234, 
249-254, 271-274, and 303-306 are Helices 

c230> 

<223> Amino acids 12-13, 30, 32, 34-35. 73-77. 111. 117, 
120, 199, 213, 215, 217-220. 287. 289, 291, 298, 
and 300 are residues in contact with peps ta tin. 
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<220> 

<223> Pepsin 
<400> 31 

Val Asp Glu Gin Pro I«eu Glo Asn Tyr Leu Asp Ket Glu Tyr Pbe Gly 
15 10 15 

Tbr lie Gly He Gly Thr Pro Ala Gin Asp Pbe Tbr Val V&l Phe Asp 
20 25 30 

T2ir Gly Ser Ser Asn Leu Trp Val Pro Ser Val Tyr Cya Ser Ser Leu 
35 40 45 

Ala Cye Thr Asn His Asn Arg Pbe Asn Pro Glu Asp Ser Ser Tbr Tyr 

50 55 60 

Gin Ser Tbr Ser Glu Thr Val Ser He Tbr Tyr Gly Tbr Gly Ser Met 
65 70 75 60 

Tbr Gly He Leu Gly Tyr Asp Tbr Val Gin Val Gly Gly He Ser Asp 
85 90 95 

Tbr Asn Glu He Pbe Gly Leu Ser Glu Thr Glu Pro Gly Ser Phe Leu 
100 105 110 

Tyr Tyr Ala Pro Pbe Asp Gly He Leu Gly Leu Ala Tyr Pro Ser He 
115 120 125 

Ser Ser Ser Gly Ala Thr Pro Val Phe Asp Asn He Trp Asn Gin Gly 
130 135 140 

Leu Val Ser Gin Asp Lea Pbe Ser Val Tyr Leu Ser Ala Asp Asp Gin 
1*5 150 155 160 

Ser Gly Ser Val Val He Phe Gly Gly He Asp Ser Ser Tyr Tyr Thr 
165 170 175 

Gly Ser Leu Asn Trp Val Pro Val Thr Val Glu Gly Tyr Trp Gin He 
180 185 190 

Thr Val Asp Ser He Thr Met Asn Gly Glu Ala He Ala Cys Ala Glu 
195 200 205 

Gly Cys Gin Ala He Val Asp Thr Gly Tbr Ser Leu Leu Thr Gly Pro 
210 215 220 

Tbr Ser Pro He Ala Asn He Gin Ser Asp He Gly Ala Ser Glu Asn 
235 230 235 240 
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Ser Asp Gly Xap Met Val Val Ser 
245 

Asp lie Val Pbe Tbr lie Asn Gly 
360 

Ala Tyr lie Leu Gls Ser Glu Gly 
375 3B0 

Met Asn l.eu Pro Thr Glu Ser Gly 
290 395 

Phe He Arg Gin Tyr Phe Thr Val 
305 310 

Gly Leu Ala Pro Val Ala 
325 



Cye Ser Ala He Ser Ser Leu Pro 
250 255 

Val Glfi Tyr Pro Val Pro Pro Ser 
365 270 

Ser Cye He Ser Gly Phe Gin Gly 
285 

Glu Leu Trp He Leu Gly Asp Val 
300 

Phe Asp Arg Ala Xsn Xsn Gin Val 

315 320 
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